We present GLM-5V-Turbo, a step toward native foundation models for multimodal agents. As foundation models are increasingly deployed in real environments, agentic capability depends not only on language reasoning, but also on the ability to perceive, interpret, and act over heterogeneous contexts such as images, videos, webpages, documents, GUIs. GLM-5V-Turbo is built around this objective: multimodal perception is integrated as a core component of reasoning, planning, tool use, and execution, rather than as an auxiliary interface to a language model. This report summarizes the main improvements behind GLM-5V-Turbo across model design, multimodal training, reinforcement learning, toolchain expansion, and integration with agent frameworks. These developments lead to strong performance in multimodal coding, visual tool use, and framework-based agentic tasks, while preserving competitive text-only coding capability. More importantly, our development process offers practical insights for building multimodal agents, highlighting the central role of multimodal perception, hierarchical optimization, and reliable end-to-end verification.
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
We present GLM-5V-Turbo, a step toward native foundation models for multimodal agents.
- Year
- 2026
- Venue
- arXiv 2026
- Authors
- 78
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2604.26752ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
78Jie TangYu WangWei LiQinkai ZhengZhenyu HouMingdao LiuZihan WangGuo WangHaoran WangWeihan WangXiaotao GuZhen YangBin XuMinlie HuangJuanzi LiYuxiao DongXinyu ZhangLicheng BaoFan YangYue WangPeng ZhangYukuo CenXinyu LiuYan WangJiale ZhuDan ZhangCong YaoYuting WangJiaxing XuWenkai LiJiale ChengZehui WangJiaxin FanJing ChenZiHao ZhouV TeamWenyi HongZiyang PanYuanchang YueYanling WangXijun LiuWenmeng YuShuaiqi DuanSheng YangRuiliang LvLihang PanKe NingJunhui JiJinjiang WangJiazheng XuJi QiGuobing GanZijun DouZhiqi GeZhijie LiZhao XueZehai HeYusen LiuYuchen LiYuan WangYijian LuYanzi WangYadong XueTianyu TongTianshu ZhangShengdong YanMingde XuJiawen QianJiali ChenJiahui LinHaozhi ZhengHaochen LiChuangxin ZhaoChengcheng WuBoyan ShiBowei JiaBaoxu WangDebing Liu