Qintong Zhang

Papers: 7

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

7papers

Authored papers

PEARL: Personalized Streaming Video Understanding Model

arXiv 2026

2026

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

arXiv 2025

2025

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

arXiv 2025

2025

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

arXiv 2025

2025

Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More

arXiv 2025

2025

DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM

arXiv 2025

2025

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

ICCV 2025

2024

Affiliations

No known affiliations.

Frequent co-authors

from 7 papers

Conghui He

Junyuan Zhang

Bin Wang

Zichen Wen

Wentao Zhang

Ka-Ho Chow

Linfeng Zhang

Linke Ouyang

Weijia Li

Fan Wu