Cite
Notes
Only stored in your browser.
Attribution
Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library
arXiv 2025
Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation
from 2 papers
Bo Zheng
Jiaheng Liu
Weixun Wang
Wenbo Su
Xingyao Zhang
Yijia Luo
Dakai An
Feilei Du
Haizhou Zhao
Huimin Yi