Cite
Notes
Only stored in your browser.
Attribution
PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails
arXiv 2024
from 1 papers
Ashish Hooda
Atul Prakash
Jihye Choi
Kassem Fawaz
Neal Mangaokar
Somesh Jha