Cite
Notes
Only stored in your browser.
Attribution
Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning
arXiv 2024
from 1 papers
Gautam Bhattacharya
Josh Kimball
Ling Liu
Tiansheng Huang