Zhuochen Wang

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

SAMTok: Representing Any Mask with Two Words

arXiv 2026

2026

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

arXiv 2025

2025

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

arXiv 2025

2025

PairUni: Pairwise Training for Unified Multimodal Language Models

arXiv 2025

2025

HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding

arXiv 2025

2025

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

arXiv 2025

2025

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

arXiv 2025

2025

From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Anran Wang

Haochen Wang

Xiangtai Li

Tao Zhang

Ye Tian

Jiacong Wang

Jiani Zheng

Yunhai Tong

Zhiyang Teng

Fengmao Lv