Xichen Pan

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising

arXiv 2026

2026

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

arXiv 2025

2025

Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis

CVPR 2025 1

2025

PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop

arXiv 2025

2025

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

arXiv 2024

2024

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

arXiv 2023

2023

Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models

arXiv 2022

2022

Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition

ACL 2022 5

2022

Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition

ACL 2022 5

2022

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Saining Xie

4 shared papers

Helong Zhou

2 shared papers

Peiyu Chen

2 shared papers

Wenhu Chen

professor

Xinbing Wang

Yichen Gong

Zhouhan Lin

Adithya Iyer

Bingda Tang

BoYang Zheng