We present Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning. Seed1.5-VL is composed with a 532M-parameter vision encoder and a Mixture-of-Experts (MoE) LLM of 20B active parameters. Despite its relatively compact architecture, it delivers strong performance across a wide spectrum of public VLM benchmarks and internal evaluation suites, achieving the state-of-the-art performance on 38 out of 60 public benchmarks. Moreover, in agent-centric tasks such as GUI control and gameplay, Seed1.5-VL outperforms leading multimodal systems, including OpenAI CUA and Claude 3.7. Beyond visual and video understanding, it also demonstrates strong reasoning abilities, making it particularly effective for multimodal reasoning challenges such as visual puzzles. We believe these capabilities will empower broader applications across diverse tasks. In this report, we mainly provide a comprehensive review of our experiences in building Seed1.5-VL across model design, data construction, and training at various stages, hoping that this report can inspire further research. Seed1.5-VL is now accessible at https://www.volcengine.com/ (Volcano Engine Model ID: doubao-1-5-thinking-vision-pro-250428)
Seed1.5-VL Technical Report
We present Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning.
- Year
- 2025
- Venue
- arXiv 2025
- Authors
- 197
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2505.07062ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
197Heng JiYujia QinHaibin LinXin LiuCan HuangKe WangYuntao LiZehua WangYu LiuKai LiuJiahao LiYanghua PengMengfei DuXinyu YangPengFei LiuRui YangJiawei WangKai HuaWenhao HuangJingqun TangWeihao YuZilong HuangJiashi FengRui WangDong GuoRenrui ZhangFeng LiYanwei LiChunyuan LiRui QianLin ChenJialong WuHaoqi FanLei LILei ShiKunchang LiZiHao WangBencheng LiaoXuefeng XiaoZhiyong WuTianheng ChengKai ShenLiang XiangCheng LinJian WangMingxuan WangJiaze ChenYonghui WuXiang LongXinchen ZhangKe ShenJunting LuYifan DuZhipeng ChenJoya ChenYining YeShihao LiangZeyu WangShen YanXuehan XiongHui ShenFaming WuFeida ZhuFuxing LengGuang ShiHaobin ChenJianyu JiangJingji ChenJingjia HuangKang LeiLiping YuanLishu LuoQinghao YeShixiong ZhaoShuai PengShuangye LiSihang YuanSijin WuWeiwei LiuWenqian WangXianhan ZengXiao LiuXiaobo QinXiaohan DingXiaojun XiaoXiaoying ZhangXuanwei ZhangYangrui ChenYanxu HuYi LinYiyuan HuYiyuan ZhangYoubin WuYu LiYudong LiuYue LingZanbo WangZhiwu HeAoxue ZhangBairen YiCan ZhangChaorui DengChaoyi DengCheng YuanChenggang LiChenhui GouChenwei LouChengzhi WeiChundian LiuDeyao ZhuDonghong ZhongFeng ZhangGang WuGuodong LiGuohong XiaoHaihua YangHaoming WangHongxiang HaoHuixia LiJianhua ZhuJianpeng JiaoJianhui DuanJihao LiuJin ZengJingyu SunJun LongJunda FengJunfeng ZhanJunjie FangKaiyuan ZhangKeyu PanKun ZhangLanxin LiLi HanLiangqiang ChenLin LiLin YanLiying ChiLongxiang LiuNingxin PanPeibin ChenPengfei ChenPengfei WuQingqing YuanQingyao ShuaiQiuyan TaoRenjie ZhengRu ZhangRui ZhaoShaoqiang XuShipeng YanShu ZhongShuaishuai CaoShuangzhi WuShufan LiuShuhan ChangSonghua CaiTenglong AoTianhao YangTingting ZhangWanjun ZhongWei JiaWei WengWenjia ZhuWenli YangWenzhi WangXiangRui YinXiao LiXiaolei ZhuXiaoying JiaXijin ZhangXiongcai LuoXiuli ChenXuantong ZhongXujing LiYan WuYawei WenYihao ZhangYu YueYufeng ZhouYufeng YuanYuhang XuYuhong YangYun ZhangYunhao FangYurui RenYuwen XiongZehua HongZewei SunZhao CaiZhaoyue ZhaZhecheng AnZhehui ZhaoZhengzhuo XuZhuofan ZhengZiyu ZhuZuquan Song