Cite
Notes
Only stored in your browser.
Attribution
MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation
arXiv 2026
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators
arXiv 2025
from 2 papers
Donglin Wang
Pengxiang Ding
Dongyuan Zang
Han Zhao
Hengtao Li
Jinkui Shi
Kexian Yu
Minghui Lin
Mingyang Sun
Runze Suo