Jitesh Jain

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

arXiv 2026

2026

SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning

arXiv 2025

2025

Slow-Fast Architecture for Video Multi-Modal Large Language Models

arXiv 2025

2025

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

arXiv 2024

2024

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

CVPR 2024 1

2023

Matting Anything

arXiv 2023

2023

OneFormer: One Transformer to Rule Universal Image Segmentation

CVPR 2023 1

2022

Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Humphrey Shi

Jiachen Li

Chris Dongjoo Kim

Christopher Clark

Jieyu Zhang

Rohun Tripathi

Sangho Lee

Zixian Ma

Ali Farhadi

CEO

1 shared paper

Ali Hassani

1 shared paper