Cite
Notes
Only stored in your browser.
Attribution
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
arXiv 2025
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
from 2 papers
Xiaoyu Tan
Chao Qu
Chaofan Qiu
Gang Li
Guocan Cai
Haojia Lin
Jason Klein Liu
Jiaran Hao
Ke Li
Lichao Chen