Andrew Head

Cite

Notes

Only stored in your browser.

Attribution

2papers

Authored papers

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

arXiv 2025

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models

CVPR 2025 1

No known affiliations.

from 2 papers

Ajay Patel

Aniruddha Kembhavi

Chris Callison-Burch

Christopher Clark

Luca Weihs

Mark Yatskar

Matt Deitke

Ranjay Krishna

Tanmay Gupta

Yue Yang