We present MiMo-7B, a large language model born for reasoning tasks, with optimization across both pre-training and post-training stages. During pre-training, we enhance the data preprocessing pipeline and employ a three-stage data mixing strategy to strengthen the base model's reasoning potential. MiMo-7B-Base is pre-trained on 25 trillion tokens, with additional Multi-Token Prediction objective for enhanced performance and accelerated inference speed. During post-training, we curate a dataset of 130K verifiable mathematics and programming problems for reinforcement learning, integrating a test-difficulty-driven code-reward scheme to alleviate sparse-reward issues and employing strategic data resampling to stabilize training. Extensive evaluations show that MiMo-7B-Base possesses exceptional reasoning potential, outperforming even much larger 32B models. The final RL-tuned model, MiMo-7B-RL, achieves superior performance on mathematics, code and general reasoning tasks, surpassing the performance of OpenAI o1-mini. The model checkpoints are available at https://github.com/xiaomimimo/MiMo.
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining
We present MiMo-7B, a large language model born for reasoning tasks, with optimization across both pre-training and post-training stages.
- Year
- 2025
- Venue
- arXiv 2025
- Authors
- 64
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2505.07608ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
64Dong ZhangPeng WangShuo LiuGang WangLiang ZhaoYi HuangDi ZhangXiaomi LLM-Core TeamBingquan XiaBowen ShenCiciDawei ZhuHailin ZhangHuaqiu LiuJiebao XiaoJinhao DongPeidian LiShihua YuShimao ChenWeikun WangWenhan MaXiangwei DengYiFan SongZihan JiangBowen YeCan CaiChenhong HeDuo ZhangGuoan WangHao TianHaochen ZhaoHeng QuHongshen XuJun ShiKainan BaoQingkai FangKang ZhouKangyang ZhouLei LIMenghang ZhuNuo ChenQiantong WangShaohui LiuShicheng LiShuhao GuShuhuai RenSirui DengWeiji ZhuangWeiwei LvWenyu YangXin ZhangXing YongXing ZhangXingchen SongXinzhe XuXu WangYihan YanYu TuYuanyuan TianYudong WangYue YuZhenru LinZhichao SongZihao Yue