Javier Rando

Papers: 7

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

7papers

Authored papers

How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition

arXiv 2026

2026

Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations

arXiv 2024

2024

Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs

arXiv 2024

2024

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

arXiv 2024

2024

PassGPT: Password Modeling and (Guided) Generation with Large Language Models

arXiv 2023

2023

Universal Jailbreak Backdoors from Poisoned Human Feedback

arXiv 2023

2023

"That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 7 papers

Florian Tramer

3 shared papers

Ahmed Salem

1 shared paper

Andy Zou

founder

Arman Zharmagambetov

Benjamin L. Edelman

Briland Hitaj

Chenhao Li

Daniel Paleka

Dragos Albastroiu

Edoardo Debenedetti