Wentao Liu
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
arXiv 2025
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
ICCV 2025
CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
arXiv 2024
AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
arXiv 2024
F-LMM: Grounding Frozen Large Multimodal Models
CVPR 2025 1
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications
arXiv 2024
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception
arXiv 2023
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
arXiv 2023
CLIM: Contrastive Language-Image Mosaic for Region Representation
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers