Cite
Notes
Only stored in your browser.
Attribution
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation
arXiv 2025
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025 1
from 2 papers
Ajay Patel
Aniruddha Kembhavi
Chris Callison-Burch
Christopher Clark
Luca Weihs
Mark Yatskar
Matt Deitke
Ranjay Krishna
Tanmay Gupta
Yue Yang