Xichen Pan
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising
arXiv 2026
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset
arXiv 2025
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
CVPR 2025 1
PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop
arXiv 2025
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
arXiv 2024
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
arXiv 2023
Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
ACL 2022 5
Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
ACL 2022 5
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
arXiv 2022
Affiliations
Frequent co-authors
10from 9 papers