This is Swadesh Swain

about
publications
projects
activities
cv

Announcement_17

Created on March 31, 2026

2026

Our new paper “CAP: Counterfactual Activation Potential for Quantifying Suppressed Safety Features in Language Models” has been submitted to COLM 2026!

© Copyright 2026 Swadesh Swain. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Photos from Unsplash.