Sangho Lee
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding
arXiv 2026
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs
arXiv 2026
SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning
arXiv 2025
MolmoAct: Action Reasoning Models that can Reason in Space
arXiv 2025
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025 1
One Diffusion to Generate Them All
CVPR 2025 1
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers