Yinpeng Dong
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Safety at Scale: A Comprehensive Survey of Large Model Safety
arXiv 2025
STAIR: Improving Safety Alignment with Introspective Reasoning
arXiv 2025
Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
arXiv 2024
Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning
arXiv 2023
GNOT: A General Neural Operator Transformer for Operator Learning
arXiv 2023
Evil Geniuses: Delving into the Safety of LLM-based Agents
arXiv 2023
Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models
arXiv 2023
Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning
arXiv 2023
Rethinking Model Ensemble in Transfer-based Adversarial Attacks
arXiv 2023
Adversarial Attacks and Defences Competition
arXiv 2018
Affiliations
Frequent co-authors
10from 10 papers