Zhuoyi Yang
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Concat-ID: Towards Universal Identity-Preserving Video Synthesis
arXiv 2025
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
arXiv 2025
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
ICCV 2025
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
arXiv 2024
CogVLM2: Visual Language Models for Image and Video Understanding
arXiv 2024
Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
arXiv 2024
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
arXiv 2024
Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
relay-diffusion-unifying-diffusion-process
GLM-130B: An Open Bilingual Pre-trained Model
arXiv 2022
CogView: Mastering Text-to-Image Generation via Transformers
NeurIPS 2021 12
Affiliations
Frequent co-authors
10from 10 papers