Cite
Notes
Only stored in your browser.
Attribution
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios
arXiv 2025
MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models
from 2 papers
Chaoyou Fu
Yang Shi
Yi-Fan Zhang
Bingyan Nie
Bohan Zeng
Haotian Wang
Hongkai Chen
Huanqian Wang
Huanyao Zhang
Liang Wang