Lei Cui
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Audio-Visual Intelligence in Large Foundation Models
arXiv 2026
UniM: A Unified Any-to-Any Interleaved Multimodal Benchmark
arXiv 2026
Geometric-Mean Policy Optimization
arXiv 2025
MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark
arXiv 2024
XDoc: Unified Pre-training for Cross-Format Document Understanding
arXiv 2022
DiT: Self-supervised Pre-training for Document Image Transformer
arXiv 2022
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking
arXiv 2022
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
arXiv 2021
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding
arXiv 2021
DocBank: A Benchmark Dataset for Document Layout Analysis
COLING 2020 8
TableBank: A Benchmark Dataset for Table Detection and Recognition
LREC 2020 5
Affiliations
Frequent co-authors
10from 11 papers