Tianyuan Qu
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
arXiv 2026
N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
arXiv 2025
RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing
arXiv 2025
Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?
ICCV 2025
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition
ICCV 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers