Cite
Notes
Only stored in your browser.
Attribution
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning
arXiv 2024
ParGo: Bridging Vision-Language with Partial and Global Views
from 2 papers
Bin Shan
Can Huang
Jingqun Tang
An-Lan Wang
Biao Yang
Binghong Wu
Chunhui Lin
Hao Feng
Hao liu
Hao Lu