Shenghai Yuan
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16Helios: Real Real-Time Long Video Generation Model
arXiv 2026
ImgEdit: A Unified Image Editing Dataset and Benchmark
arXiv 2025
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
arXiv 2025
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation
arXiv 2025
MAGREF: Masked Guidance for Any-Reference Video Generation
arXiv 2025
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
arXiv 2025
MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation
arXiv 2025
Sci-Fi: Symmetric Constraint for Frame Inbetweening
arXiv 2025
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
arXiv 2025
Open-Sora Plan: Open-Source Large Video Generation Model
arXiv 2024
Identity-Preserving Text-to-Video Generation by Frequency Decomposition
CVPR 2025 1
MMAUD: A Comprehensive Multi-Modal Anti-UAV Dataset for Modern Miniature Drone Threats
arXiv 2024
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
arXiv 2024
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
CVPR 2025 1
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
arXiv 2024
AirSLAM: An Efficient and Illumination-Robust Point-Line Visual SLAM System
arXiv 2024
Affiliations
Frequent co-authors
10from 16 papers