Cite
Notes
Only stored in your browser.
Attribution
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
arXiv 2025
CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
arXiv 2024
from 2 papers
Aimin Zhou
Ali Vosoughi
Bo Jiang
Chao Huang
Chenliang Xu
Daiki Shimada
Hang Hua
Ji Wu
Jie zhou
Jiebo Luo