Yin Xie
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence
arXiv 2026
OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence
arXiv 2026
RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm
arXiv 2025
Region-based Cluster Discrimination for Visual Representation Learning
ICCV 2025
LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training
arXiv 2025
Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers