Mahdi Rad
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding
arXiv 2026
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models
arXiv 2026
EPFL-Smart-Kitchen-30: Densely annotated cooking dataset with 3D kinematics to challenge video and language models
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers