Yingya Zhang
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
arXiv 2025
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
arXiv 2025
TTS-VAR: A Test-Time Scaling Framework for Visual Auto-Regressive Generation
arXiv 2025
Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance
arXiv 2025
Wan: Open and Advanced Large-Scale Video Generative Models
arXiv 2025
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
ICCV 2025
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
arXiv 2024
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
arXiv 2023
DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic Models
arXiv 2023
ModelScope Text-to-Video Technical Report
arXiv 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
arXiv 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
CVPR 2024 1
RLIPv2: Fast Scaling of Relational Language-Image Pre-training
ICCV 2023 1
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
ICCV 2023 1
Affiliations
Frequent co-authors
10from 14 papers