Matt Fredrikson
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9The Vision Wormhole: Latent-Space Communication in Heterogeneous Multi-Agent Systems
arXiv 2026
How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition
arXiv 2026
Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
arXiv 2025
Improving Alignment and Robustness with Circuit Breakers
arXiv 2024
AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents
arXiv 2024
Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents
arXiv 2024
Universal and Transferable Adversarial Attacks on Aligned Language Models
arXiv 2023
Representation Engineering: A Top-Down Approach to AI Transparency
arXiv 2023
Unlocking Deterministic Robustness Certification on ImageNet
unlocking-deterministic-robustness
Affiliations
Frequent co-authors
10from 9 papers