Wenxuan Song
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment
arXiv 2026
MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation
arXiv 2026
OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation
arXiv 2025
HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
arXiv 2025
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
arXiv 2025
Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model
arXiv 2025
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process
arXiv 2025
MapSAM: Adapting Segment Anything Model for Automated Feature Detection in Historical Maps
arXiv 2024
Affiliations
Frequent co-authors
10from 8 papers