Cite
Notes
Only stored in your browser.
Attribution
Representation Noising: A Defence Mechanism Against Harmful Finetuning
arXiv 2024
from 1 papers
Carsten Maple
Domenic Rosati
Frank Rudzicz
Hassan Sajjad
Jan Wehner
Kai Williams
Łukasz Bartoszcze
Robie Gonzales
Subhabrata Majumdar