Haozhe Qi
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding
arXiv 2026
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models
arXiv 2026
LLaVAction: evaluating and training multi-modal large language models for action recognition
arXiv 2025
EPFL-Smart-Kitchen-30: Densely annotated cooking dataset with 3D kinematics to challenge video and language models
arXiv 2025
P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds
p2b-point-to-box-network-for-3d-object-1
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers