Cite
Notes
Only stored in your browser.
Attribution
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
arXiv 2025
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
arXiv 2024
from 2 papers
Yu Qiao
Baoqi Pei
Bin Wang
Bo Zhang
Bowen Zhou
professor
Boyu Niu
Chang Yuan
Chao Xu
Conghui He
Dahua Lin