Cite
Notes
Only stored in your browser.
Attribution
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs
arXiv 2025
from 1 papers
Bin Hu
Cai Chen
Deng Zhao
Ding Liu
dingnan jin
Feng Zhu
Hao Dai
Hongzhi Luan
Jia Guo
Jiaming Liu