Yixu Wang
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10OpenRT: An Open-Source Red Teaming Framework for Multimodal LLMs
arXiv 2026
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
arXiv 2026
Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs
arXiv 2025
BackdoorVLM: A Benchmark for Backdoor Attacks on Vision-Language Models
arXiv 2025
Safety at Scale: A Comprehensive Survey of Large Model Safety
arXiv 2025
MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models
arXiv 2024
Reflection-Bench: probing AI intelligence with reflection
arXiv 2024
ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
arXiv 2024
Flames: Benchmarking Value Alignment of LLMs in Chinese
arXiv 2023
Fake Alignment: Are LLMs Really Aligned Well?
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers