Yatian Pang
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
arXiv 2025
SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video
arXiv 2025
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
arXiv 2024
Open-Sora Plan: Open-Source Large Video Generation Model
arXiv 2024
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation
arXiv 2024
Next Patch Prediction for Autoregressive Visual Generation
arXiv 2024
Envision3D: One Image to 3D with Anchor Views Interpolation
arXiv 2024
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
arXiv 2023
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
arXiv 2023
Masked Autoencoders for Point Cloud Self-supervised Learning
arXiv 2022
Affiliations
Frequent co-authors
10from 10 papers