Sijie Zhu
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Vidi: Large Multimodal Models for Video Understanding and Editing
arXiv 2025
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
ICCV 2025
Where do Large Vision-Language Models Look at when Answering Questions?
arXiv 2025
Multi-Reward as Condition for Instruction-based Image Editing
arXiv 2024
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
arXiv 2024
Visual Explanation for Deep Metric Learning
arXiv 2019
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers