Shiwei Zhang
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation
arXiv 2026
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
arXiv 2025
TTS-VAR: A Test-Time Scaling Framework for Visual Auto-Regressive Generation
arXiv 2025
ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
arXiv 2025
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
arXiv 2025
Wan: Open and Advanced Large-Scale Video Generative Models
arXiv 2025
Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance
arXiv 2025
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
ICCV 2025
Animate-X: Universal Character Image Animation with Enhanced Motion Representation
arXiv 2024
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
arXiv 2024
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
arXiv 2023
DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic Models
arXiv 2023
ModelScope Text-to-Video Technical Report
arXiv 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
ICCV 2023 1
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
CVPR 2024 1
RLIPv2: Fast Scaling of Relational Language-Image Pre-training
ICCV 2023 1
Affiliations
Frequent co-authors
10from 16 papers