Boshen Xu
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Xiaomi MiMo-VL-Miloco Technical Report
arXiv 2025
TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding
arXiv 2025
TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM
arXiv 2025
SPAFormer: Sequential 3D Part Assembly with Transformers
arXiv 2024
POV: Prompt-Oriented View-Agnostic Learning for Egocentric Hand-Object Interaction in the Multi-View World
arXiv 2024
EgoNCE++: Do Egocentric Video-Language Models Really Understand Hand-Object Interactions?
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers