Tanmay Gupta
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation
arXiv 2025
SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning
arXiv 2025
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025 1
m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
arXiv 2024
Towards General Purpose Vision Systems
arXiv 2021
Visual Semantic Role Labeling for Video Understanding
CVPR 2021 1
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers