Zekun Qi
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model
arXiv 2026
Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining
arXiv 2026
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
arXiv 2025
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
arXiv 2025
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
dreamvla-a-vision-language-action-model
Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation
arXiv 2025
Reasoning in Space via Grounding in the World
arXiv 2025
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
arXiv 2024
Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast
arXiv 2023
VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation
vpp-efficient-conditional-3d-generation-via
DreamLLM: Synergistic Multimodal Comprehension and Creation
arXiv 2023
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
arXiv 2023
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
arXiv 2022
Affiliations
Frequent co-authors
10from 13 papers