Weihan Wang
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
arXiv 2026
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
arXiv 2025
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
arXiv 2025
UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation
arXiv 2025
CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
arXiv 2024
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
arXiv 2024
CogVLM2: Visual Language Models for Image and Video Understanding
arXiv 2024
Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation
CVPR 2023 1
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024 1
Affiliations
Frequent co-authors
10from 10 papers