We introduce LongCat-Flash-Thinking-2601, a 560-billion-parameter open-source Mixture-of-Experts (MoE) reasoning model with superior agentic reasoning capability. LongCat-Flash-Thinking-2601 achieves state-of-the-art performance among open-source models on a wide range of agentic benchmarks, including agentic search, agentic tool use, and tool-integrated reasoning. Beyond benchmark performance, the model demonstrates strong generalization to complex tool interactions and robust behavior under noisy real-world environments. Its advanced capability stems from a unified training framework that combines domain-parallel expert training with subsequent fusion, together with an end-to-end co-design of data construction, environments, algorithms, and infrastructure spanning from pre-training to post-training. In particular, the model's strong generalization capability in complex tool-use are driven by our in-depth exploration of environment scaling and principled task construction. To optimize long-tailed, skewed generation and multi-turn agentic interactions, and to enable stable training across over 10,000 environments spanning more than 20 domains, we systematically extend our asynchronous reinforcement learning framework, DORA, for stable and efficient large-scale multi-environment training. Furthermore, recognizing that real-world tasks are inherently noisy, we conduct a systematic analysis and decomposition of real-world noise patterns, and design targeted training procedures to explicitly incorporate such imperfections into the training process, resulting in improved robustness for real-world applications. To further enhance performance on complex reasoning tasks, we introduce a Heavy Thinking mode that enables effective test-time scaling by jointly expanding reasoning depth and width through intensive parallel thinking.
LongCat-Flash-Thinking-2601 Technical Report
A 560-billion-parameter Mixture-of-Experts reasoning model achieves state-of-the-art performance on agentic benchmarks through a unified training framework combining domain-parallel expert training with fusion, along with enhancements for real-world robustness and complex reasoning.
- Year
- 2026
- Venue
- arXiv 2026
- Authors
- 161
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2601.16725ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
161Fan WuYang LiuXin ChenChao ZhangYihao ChenZhijian LiuJian YangYulei QianYuchen XieZiwen WangXuezhi CaoXunliang CaiLinsen GuoJianing WangDengchang ZhaoJinrui DingXiandi MaPeng PeiShuang ZhouWei ShiWei WangPeng ZhaoHao YangWei LiuYuxin ChenZhao YangYifan LuChengcheng HanZhiyuan YaoZhuowen HanQi GuWentao ChenZhengyu ChenShuo WangChen ZhangXiaoyu LiXiao LiuYang YangTao LiangXiaolong ChenYi ZhangHaibin WangHongyan HaoXingchen LiuChen GaoRongzhi ZhangZijian ZhangDongyu RuLin QiuYaorui ShiYuxin LiuBei LiJingang WangHui SuYang BaiHongyu ZangYan ChenJinluan YangYaqi HuoXi SuJiapeng ZhuJiaMing WangShengnan AnHaoxiang MaJiacheng ZhangWentao ShiMeituan LongCat TeamAnchun GuiBingyang TaoBole ZhouBorun ChenChenhui YangChuyu ZhangCong ChenCunguang WangDaoru PanDefei BuDi XiuDishan LiuDunwei TuFengcheng YuanFengcun LiGang XuGuanyu WuGuoyuan LinHansi YangHaonan YanHaoxing WenHongyin TangHongzhi NiJiahong ZhouJiahuan LiJianfei ZhangJianhao XuJiaqi SunJiarong ShiJiarui ZhaoJinwei XiaoJiyuan HeJuncan XuKefeng ZhangKeheng WangLi WeiLianhui MaLingbing KongLingchuan LiuMengshen ZhuMengxia ShenMingyang ZhuPeiguang LiPengcheng JiaPengtao ZhangQiong HuangQiyuan DuanQuanchi WengRongxiang WengRumei LiShanglin LeiShijun DaiShuaikang LiuSongyuan ZhaoTianhao HuTianze ChenWeifeng TangWenjie ShiWenlong ZhuXiangcheng LiuXiangyu XiXiangyuan LiuXiangzhou HuangXiaodong CaiXiaowei ShiXuan HuangYang ZhengYaoming WangYaoming ZhuYanyu ChenYerui SunYi-Kai ZhangYifan ZhaoYitao ZhaiYongjing YinYongwei ZhouYoushao XiaoYuchuan DaiYuchen YuYufei ZhangYuhuai WeiYunfan LiangYunke ZhaoYuwei JiangYuxin BianYue XuYueqing SunZeyang YuZhengsheng HuangZhikang XiaZhimin LinZhuofan ChenZiran LiZiyuan Zhuang