Cite
Notes
Only stored in your browser.
Attribution
BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models
arXiv 2024
from 1 papers
Bo Li
Dawn Song
professor
Ruoxi Jia
Weiyu Sun
Yi Zeng