Cite
Notes
Only stored in your browser.
Attribution
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs
arXiv 2025
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
arXiv 2024
from 2 papers
King Zhu
Bo Li
Boyu Feng
David Ma
Ge Zhang
researcher
Graham Neubig
professor
Jiaheng Liu
Jincheng Ren
Jun Ma
Meng Cao