Cite
Notes
Only stored in your browser.
Attribution
A Token-level Text Image Foundation Model for Document Understanding
ICCV 2025
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
CVPR 2025 1
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
CVPR 2024 1
from 3 papers
Pei Fu
Qianyi Jiang
Junfeng Luo
Shan Guo
Tongkun Guan
Wei Shen
Xiaokang Yang
Zining Wang
Hao Sun
Kai Zhou