Cite
Notes
Only stored in your browser.
Attribution
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
arXiv 2025
from 1 papers
Baolin Peng
Hao Cheng
Jianfeng Gao
Kuan Wang
Liliang Ren
Qing Yang
Shuohang Wang
Simon Shaolei Du
Weizhu Chen
Xuehai He