Cite
Notes
Only stored in your browser.
Attribution
Obfuscated Activations Bypass LLM Latent-Space Defenses
arXiv 2024
from 1 papers
Abhay Sheshadri
Alex Serrano
Carlos Guestrin
Jacob Hilton
researcher
Jordan Taylor
Luke Bailey
Mikhail Seleznyov
Scott Emmons
Stephen Casper