Cite
Notes
Only stored in your browser.
Attribution
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
CVPR 2025 1
M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models
arXiv 2024
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models
from 3 papers
Arman Cohan
Chuhan Li
Yilun Zhao
Yixin Liu
Chen Zhao
Chengye Wang
Deyuan Li
Guo Gan
Haowei Zhang
Junyang Song