Cite
Notes
Only stored in your browser.
Attribution
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025 1
SAT: Dynamic Spatial Aptitude Training for Multimodal Language Models
arXiv 2024
AI2-THOR: An Interactive 3D Environment for Visual AI
arXiv 2017
from 3 papers
Aniruddha Kembhavi
Ali Farhadi
CEO
Eli VanderBilt
Kuo-Hao Zeng
Luca Weihs
Matt Deitke
Ranjay Krishna
Rose Hendrix
Aaron Sarnat
Abhinav Gupta