Javier Rando
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition
arXiv 2026
Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
arXiv 2024
Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
arXiv 2024
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
arXiv 2024
PassGPT: Password Modeling and (Guided) Generation with Large Language Models
arXiv 2023
Universal Jailbreak Backdoors from Poisoned Human Feedback
arXiv 2023
"That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers