Xiangyu Qi
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition
arXiv 2026
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
arXiv 2024
Safety Alignment Should Be Made More Than Just a Few Tokens Deep
arXiv 2024
On Evaluating the Durability of Safeguards for Open-Weight LLMs
arXiv 2024
Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
arXiv 2023
Visual Adversarial Examples Jailbreak Aligned Large Language Models
arXiv 2023
Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks
CVPR 2022 1
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers