Yunzhi Zhuge
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?
arXiv 2026
Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge
arXiv 2025
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
arXiv 2025
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
CVPR 2025 1
StableIdentity: Inserting Anybody into Anywhere at First Sight
arXiv 2024
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting
arXiv 2024
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
arXiv 2024
CTVIS: Consistent Training for Online Video Instance Segmentation
ICCV 2023 1
Affiliations
Frequent co-authors
10from 8 papers