Yiwu Zhong

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

PAVE: Patching and Adapting Video Large Language Models

CVPR 2025 1

2025

Rethinking Chain-of-Thought Reasoning for Videos

arXiv 2025

2025

AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning

ICCV 2025

2024

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

arXiv 2024

2024

Towards Learning a Generalist Model for Embodied Navigation

CVPR 2024 1

2023

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

arXiv 2023

2023

Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations

CVPR 2023 1

2023

Learning Concise and Descriptive Attributes for Visual Recognition

ICCV 2023 1

2023

RegionCLIP: Region-based Language-Image Pretraining

CVPR 2022 1

2021

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Yin Li

Jianfeng Gao

Jianwei Yang

LiWei Wang

An Yan

Julian McAuley

Zhuoming Liu

Bocheng Zou

chengyu dong

Chunyuan Li