Cite
Notes
Only stored in your browser.
Attribution
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios
arXiv 2025
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark
from 3 papers
Bohan Zeng
Xinlong Chen
Yang Shi
Yuanxing Zhang
Chaoyou Fu
Haotian Wang
Huanqian Wang
Pengfei Wan
Wenting Liu
Yi-Fan Zhang