Wenyao Zhang
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model
arXiv 2026
Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining
arXiv 2026
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
arXiv 2025
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
arXiv 2025
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
dreamvla-a-vision-language-action-model
Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation
arXiv 2025
Reasoning in Space via Grounding in the World
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers