Zechen Bai

Papers: 10

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

10papers

Authored papers

Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance

arXiv 2026

2026

EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models

arXiv 2025

2025

Impossible Videos

arXiv 2025

2025

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

arXiv 2024

2024

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

CVPR 2025 1

2024

Hallucination of Multimodal Large Language Models: A Survey

arXiv 2024

2024

LOVA3: Learning to Visual Question Answering, Asking and Assessment

arXiv 2024

2024

One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

arXiv 2024

2024

Object-Centric Multiple Object Tracking

ICCV 2023 1

2023

Unsupervised Open-Vocabulary Object Localization in Videos

ICCV 2023 1

2023

Affiliations

No known affiliations.

Frequent co-authors

from 10 papers

Mike Zheng Shou

Tong He

Zheng Zhang

Tianjun Xiao

Bernt Schiele

Carl-Johann Simon-Gabriel

Difei Gao

Dominik Zietlow

Francesco Locatello

Kevin Qinghong Lin