Xiao Yang
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19Helios: Real Real-Time Long Video Generation Model
arXiv 2026
LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model
arXiv 2026
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders
arXiv 2026
STAIR: Improving Safety Alignment with Introspective Reasoning
arXiv 2025
IP-Prompter: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting
arXiv 2025
Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
arXiv 2024
CRAG -- Comprehensive RAG Benchmark
arXiv 2024
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
arXiv 2024
GuardT2I: Defending Text-to-Image Models from Adversarial Prompts
arXiv 2024
MVDream: Multi-view Diffusion for 3D Generation
arXiv 2023
On Evaluating Adversarial Robustness of Large Vision-Language Models
NeurIPS 2023 11
A Recipe for Watermarking Diffusion Models
arXiv 2023
Evil Geniuses: Delving into the Safety of LLM-based Agents
arXiv 2023
MagicPose: Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
arXiv 2023
Rethinking Model Ensemble in Transfer-based Adversarial Attacks
arXiv 2023
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
dab-detr-dynamic-anchor-boxes-are-better
Shifted Diffusion for Text-to-image Generation
CVPR 2023 1
Robustness and Accuracy Could Be Reconcilable by (Proper) Definition
arXiv 2022
Adversarial Open Domain Adaptation for Sketch-to-Photo Synthesis
arXiv 2021
Affiliations
Frequent co-authors
10from 19 papers