Matt Fredrikson

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

The Vision Wormhole: Latent-Space Communication in Heterogeneous Multi-Agent Systems

arXiv 2026

2026

How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition

arXiv 2026

2026

Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing

arXiv 2025

2025

Improving Alignment and Robustness with Circuit Breakers

arXiv 2024

2024

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

arXiv 2024

2024

Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents

arXiv 2024

2024

Universal and Transferable Adversarial Attacks on Aligned Language Models

arXiv 2023

2023

Representation Engineering: A Top-Down Approach to AI Transparency

arXiv 2023

2023

Unlocking Deterministic Robustness Certification on ImageNet

unlocking-deterministic-robustness

2023

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Andy Zou

founder

7 shared papers

Zifan Wang

5 shared papers

Dan Hendrycks

director

3 shared papers

J. Zico Kolter

3 shared papers

Maxwell Lin

3 shared papers

Zico Kolter

professor

Derek Duenas

Eric Winsor

Justin Wang

Long Phan

researcher

2 shared papers