Runsen Xu
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
arXiv 2025
G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
arXiv 2025
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
arXiv 2025
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding
arXiv 2025
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
arXiv 2024
Grounded 3D-LLM with Referent Tokens
arXiv 2024
PointLLM: Empowering Large Language Models to Understand Point Clouds
arXiv 2023
Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers
arXiv 2023
Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator
fine-grained-cross-view-geo-localization
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
CVPR 2023 1
Affiliations
Frequent co-authors
10from 10 papers