Yuan Zhou
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars
arXiv 2026
VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining
arXiv 2026
Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenization
arXiv 2026
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters
arXiv 2025
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
arXiv 2025
HunyuanVideo 1.5 Technical Report
arXiv 2025
Video Generation Models Are Good Latent Reward Models
arXiv 2025
Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy
arXiv 2025
On Path to Multimodal Generalist: General-Level and General-Bench
arXiv 2025
Allegro: Open the Black Box of Commercial-Level Video Generation Model
arXiv 2024
Affiliations
Frequent co-authors
10from 10 papers