Neil Gong
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Safety at Scale: A Comprehensive Survey of Large Model Safety
arXiv 2025
PLeak: Prompt Leaking Attacks against Large Language Model Applications
arXiv 2024
GradSafe: Detecting Jailbreak Prompts for LLMs via Safety-Critical Gradient Analysis
arXiv 2024
AI-generated Image Detection: Passive or Watermark?
arXiv 2024
SneakyPrompt: Jailbreaking Text-to-image Generative Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers