Anwen Hu
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding
arXiv 2024
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models
arXiv 2024
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration
CVPR 2024 1
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks
arXiv 2023
UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
arXiv 2023
Movie101: A New Movie Understanding Benchmark
arXiv 2023
MPMQA: Multimodal Question Answering on Product Manuals
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers