Zechen Bai
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance
arXiv 2026
EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models
arXiv 2025
Impossible Videos
arXiv 2025
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
arXiv 2024
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
CVPR 2025 1
Hallucination of Multimodal Large Language Models: A Survey
arXiv 2024
LOVA3: Learning to Visual Question Answering, Asking and Assessment
arXiv 2024
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
arXiv 2024
Unsupervised Open-Vocabulary Object Localization in Videos
ICCV 2023 1
Object-Centric Multiple Object Tracking
ICCV 2023 1
Affiliations
Frequent co-authors
10from 10 papers