Yang Ye
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9ImgEdit: A Unified Image Editing Dataset and Benchmark
arXiv 2025
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
arXiv 2025
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation
arXiv 2025
MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft
arXiv 2025
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
arXiv 2025
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
arXiv 2024
Open-Sora Plan: Open-Source Large Video Generation Model
arXiv 2024
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
CVPR 2025 1
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
video-llava-learning-united-visual
Affiliations
Frequent co-authors
10from 9 papers