Yinpeng Dong

Papers: 10

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

10papers

Authored papers

STAIR: Improving Safety Alignment with Introspective Reasoning

arXiv 2025

2025

Safety at Scale: A Comprehensive Survey of Large Model Safety

arXiv 2025

2025

Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

arXiv 2024

2024

GNOT: A General Neural Operator Transformer for Operator Learning

arXiv 2023

2023

Evil Geniuses: Delving into the Safety of LLM-based Agents

arXiv 2023

2023

Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models

arXiv 2023

2023

Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning

arXiv 2023

2023

Rethinking Model Ensemble in Transfer-based Adversarial Attacks

arXiv 2023

2023

Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning

arXiv 2023

2023

Adversarial Attacks and Defences Competition

arXiv 2018

2018

Affiliations

No known affiliations.

Frequent co-authors

from 10 papers

Hang Su

Jun Zhu

Xiao Yang

Cihang Xie

Huanran Chen

Qingni Shen

Shengfang Zhai

Tianyu Pang

Tongliang Liu

Yang Liu