Haozhe Qi

Cite

Notes

Only stored in your browser.

Attribution

5papers

Authored papers

AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding

arXiv 2026

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

arXiv 2026

LLaVAction: evaluating and training multi-modal large language models for action recognition

arXiv 2025

EPFL-Smart-Kitchen-30: Densely annotated cooking dataset with 3D kinematics to challenge video and language models

arXiv 2025

P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds

p2b-point-to-box-network-for-3d-object-1

No known affiliations.

from 5 papers

Alexander Mathis

Mahdi Rad

Marc Pollefeys

Kevin Qu

Rui Wang

Andy Bonnetto

Chen Feng

Feng Zhao

Franklin Leong

Friedhelm Hummel