Yonghui Wang
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4ROOT: VLM based System for Indoor Scene Understanding and Beyond
arXiv 2024
AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding
arXiv 2024
TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding
arXiv 2024
Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers