Wentao Liu

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

arXiv 2025

2025

Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

ICCV 2025

2025

AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks

arXiv 2024

2024

F-LMM: Grounding Frozen Large Multimodal Models

CVPR 2025 1

2024

CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications

arXiv 2024

2024

CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models

arXiv 2024

2024

CLIM: Contrastive Language-Image Mosaic for Region Representation

arXiv 2023

2023

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception

arXiv 2023

2023

CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Sheng Jin

Chen Change Loy

Lumin Xu

Size Wu

Wenwei Zhang

Chen Qian

Ping Luo

Wei Li

Aimin Zhou

Aohan Zeng