Chenming Zhu
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation
arXiv 2025
G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
arXiv 2025
StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling
arXiv 2025
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
arXiv 2025
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
arXiv 2025
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers