Cha Zhang
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding
arXiv 2025
XDoc: Unified Pre-training for Cross-Format Document Understanding
arXiv 2022
DiT: Self-supervised Pre-training for Document Image Transformer
arXiv 2022
Unifying Vision, Text, and Layout for Universal Document Processing
CVPR 2023 1
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
arXiv 2021
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
arXiv 2021
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers