Cite
Notes
Only stored in your browser.
Attribution
Shallow-π: Knowledge Distillation for Flow-based VLAs
arXiv 2026
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
from 2 papers
Boseong Jeon
Jeonghoon Shim
Jongwon Lim
Minjae Oh
Taehan Kim
Woojin Ahn
Yohan Jo