Xianhang Li
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning
ICCV 2025
OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning
arXiv 2025
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
arXiv 2024
What If We Recaption Billions of Web Images with LLaMA-3?
arXiv 2024
Autoregressive Pretraining with Mamba in Vision
arXiv 2024
CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a \$10,000 Budget; An Extra \$4,000 Unlocks 81.8% Accuracy
arXiv 2023
Unleashing the Power of Visual Prompting At the Pixel Level
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers