Sijie Zhu

Papers: 6

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

6papers

Authored papers

Vidi: Large Multimodal Models for Video Understanding and Editing

arXiv 2025

2025

SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing

ICCV 2025

2025

Where do Large Vision-Language Models Look at when Answering Questions?

arXiv 2025

2025

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

arXiv 2024

2024

Multi-Reward as Condition for Instruction-based Image Editing

arXiv 2024

2024

Visual Explanation for Deep Metric Learning

arXiv 2019

2019

Affiliations

No known affiliations.

Frequent co-authors

from 6 papers

Fan Chen

Longyin Wen

Chia-Wen Kuo

Ming Li

Xin Gu

Chen Chen

Xiaoying Xing

Celong Liu

Dawei Du

Guang Chen