Cite
Notes
Only stored in your browser.
Attribution
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
arXiv 2025
from 1 papers
Chengfeng Zhao
Huanxuan Liao
Jun Zhao
Kang Liu
Minzheng Wang
Shizhu He
Tian Liang
Yuqiao Tan