Cite
Notes
Only stored in your browser.
Attribution
The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
arXiv 2025
from 1 papers
Chao Qu
Jason Klein Liu
Long Li
Shirui Pan
Wei Chu
Xiaoyu Tan
Yuan Qi
Zhe Wang
Zhijian Zhou