Xinyuan Chen
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling
arXiv 2025
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
arXiv 2025
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
arXiv 2024
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
CVPR 2025 1
Latte: Latent Diffusion Transformer for Video Generation
arXiv 2024
Vlogger: Make Your Dream A Vlog
CVPR 2024 1
Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models
arXiv 2024
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model
arXiv 2023
Long-Term Rhythmic Video Soundtracker
arXiv 2023
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
arXiv 2023
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion
CVPR 2024 1
Diff-Font: Diffusion Model for Robust One-Shot Font Generation
arXiv 2022
Cross Attention Based Style Distribution for Controllable Person Image Synthesis
arXiv 2022
Affiliations
Frequent co-authors
10from 13 papers