Cite
Notes
Only stored in your browser.
Attribution
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding
arXiv 2026
Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations
arXiv 2025
from 2 papers
Ranjay Krishna
Zixian Ma
Ali Farhadi
CEO
Chris Dongjoo Kim
Christopher Clark
George Stoica
Jae Sung Park
Jianrui Zhang
Jiawei Gu
Jieyu Zhang