Xiaofan Li
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding
arXiv 2026
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models
arXiv 2026
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
arXiv 2025
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
arXiv 2025
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
arXiv 2025
Artemis: Structured Visual Reasoning for Perception Policy Learning
arXiv 2025
Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception
arXiv 2024
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model
arXiv 2023
Affiliations
Frequent co-authors
10from 8 papers