Cite
Notes
Only stored in your browser.
Attribution
Safety Alignment Should Be Made More Than Just a Few Tokens Deep
arXiv 2024
from 1 papers
Ahmad Beirami
Ashwinee Panda
Kaifeng Lyu
Peter Henderson
Prateek Mittal
Xiangyu Qi
Xiao Ma