We present Ring-1T, the first open-source, state-of-the-art thinking model with a trillion-scale parameter. It features 1 trillion total parameters and activates approximately 50 billion per token. Training such models at a trillion-parameter scale introduces unprecedented challenges, including train-inference misalignment, inefficiencies in rollout processing, and bottlenecks in the RL system. To address these, we pioneer three interconnected innovations: (1) IcePop stabilizes RL training via token-level discrepancy masking and clipping, resolving instability from training-inference mismatches; (2) C3PO++ improves resource utilization for long rollouts under a token budget by dynamically partitioning them, thereby obtaining high time efficiency; and (3) ASystem, a high-performance RL framework designed to overcome the systemic bottlenecks that impede trillion-parameter model training. Ring-1T delivers breakthrough results across critical benchmarks: 93.4 on AIME-2025, 86.72 on HMMT-2025, 2088 on CodeForces, and 55.94 on ARC-AGI-v1. Notably, it attains a silver medal-level result on the IMO-2025, underscoring its exceptional reasoning capabilities. By releasing the complete 1T parameter MoE model to the community, we provide the research community with direct access to cutting-edge reasoning capabilities. This contribution marks a significant milestone in democratizing large-scale reasoning intelligence and establishes a new baseline for open-source model performance.
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Ring-1T, a trillion-parameter open-source thinking model, addresses training challenges with IcePop, C3PO++, and ASystem, achieving top results across benchmarks and democratizing large-scale reasoning intelligence.
- Year
- 2025
- Venue
- arXiv 2025
- Authors
- 104
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2510.18855ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
104Wenbo YuChao HuangLei ChenYuchen YanChao ZhangJunbo ZhaoZhe LiJian LiuJia GuoMeng LiTianyu ZhouJun MeiTongkai YangZhenzhong LanJin YangZhengke GuiZhiqiang ZhangJun ZhouZiHao WangTao ZhangLinfeng ShiJiaming LiuCheng LinXinyu TangQi ZuoXudong WangXudong HanHong LiuBin HuXueyu HuZHIXUN LIFeng XuYuanyuan WangWeihua ChenTianyu ZhangXin ZhaoTiwei Biezehuan liZhenxuan PanTao WuLing TeamAnqi ShenBaihui LiBin JingCai ChenChaokun YangChengyao WenCongqi LiDeng ZhaoDingbo YuanDonghai YouFagui MaoFanzhuang MengGuojie LiGuowei WangHao DaiHaonan ZhengJianhao FuJiannan ShiJianwen WangJianxin LaiJunping ZhaoKuan XuLe SuLi TangLiang JiangLiangcheng FuLianhao XuLisha LiaoLongfei ZhengMingchun ChenQiang ChengQianggang CaoQitao ShiQuanrui GuoSenlin ZhuShaofei WangShaomian ZhengShuaicheng LiShuwei GuSiba ChenWang HongWang RenWengang ZhengXiangchun WangXiaodong YanXiaopei WanXinyu KongXuemin YangYalin ZhangYan SunYicheng ShanYilong WangYingying XuYongkang LiuYongzhen GuoYuefan WangYuhong GuoZhankai XuZhenduo ZhangZhenyu HuangZhiqiang DingZhizhen LiuZujie Wen