We present STEP3-VL-10B, a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal intelligence. STEP3-VL-10B is realized through two strategic shifts: first, a unified, fully unfrozen pre-training strategy on 1.2T multimodal tokens that integrates a language-aligned Perception Encoder with a Qwen3-8B decoder to establish intrinsic vision-language synergy; and second, a scaled post-training pipeline featuring over 1k iterations of reinforcement learning. Crucially, we implement Parallel Coordinated Reasoning (PaCoRe) to scale test-time compute, allocating resources to scalable perceptual reasoning that explores and synthesizes diverse visual hypotheses. Consequently, despite its compact 10B footprint, STEP3-VL-10B rivals or surpasses models 10times-20times larger (e.g., GLM-4.6V-106B, Qwen3-VL-235B) and top-tier proprietary flagships like Gemini 2.5 Pro and Seed-1.5-VL. Delivering best-in-class performance, it records 92.2% on MMBench and 80.11% on MMMU, while excelling in complex reasoning with 94.43% on AIME2025 and 75.95% on MathVision. We release the full model suite to provide the community with a powerful, efficient, and reproducible baseline.
STEP3-VL-10B Technical Report
We present STEP3-VL-10B, a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal intelligence.
- Year
- 2026
- Venue
- arXiv 2026
- Authors
- 93
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2601.09668ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
93Daxin JiangLiangyu ChenJun ZhangAng LiJie zhouXin HuangLiang ZhaoYang LiChunrui HanGuopeng LiYuang PengQuan SunJingwei WuZheng GeYibo ZhuXiangyu ZhangWenwen QuFanqi WanZhewei HuangHongYu ZhouJia WangYanming XuJianjian SunDi QiJingcheng HuYinmin ZhangQi HanZhe XieAilin HuangBo DongYu ZhouZhenyi LuJie ChengHan ZhouWei JiHangyu GuoXinran WangHanshan ZhangSiyuan ZhangEn YuKangheng LinYana WeiAobo KongChangyi WanChengyuan YaoHaoran LvHebin ZhouHongbo PengJian ZhouJiaran ZhangJiashu LvJiayi FuJingJing XieJunfeng LiuKaijun TanKaiwen YanLina ChenMingliang LiMitt HuangQian ZhaoShaoliang PangShengjie FanShijie ShangSong YuanWuxun XieXiangfeng WangXiangwen KongXiaobo YangXiaoran JiaoXiaoxiao RenXin WuXing ChenYanlin LaiYeqing ShenYingxiu ZhaoYue PengYusheng LiYuxiang YangYuyang ChenZejia WengYukang ShiDingming LiZiyang MengTianhao YouXuelin ZhangJisheng YinHaolong YanZihui ChengDavid WangHaiquan YinXiaojie HouYuyang ZhangZhimin Fan