Cite
Notes
Only stored in your browser.
Attribution
SOM Directions are Better than One: Multi-Directional Refusal Suppression in Language Models
arXiv 2025
from 1 papers
Battista Biggio
Fabio Brau
Fabio Roli
Giorgio Piras
Raffaele Mura