Muzhi Zhu
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11LLaDA2.1: Speeding Up Text Diffusion via Token Editing
arXiv 2026
Exploring Spatial Intelligence from a Generative Perspective
arXiv 2026
OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering
arXiv 2026
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
arXiv 2025
DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks
arXiv 2025
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
arXiv 2025
Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
arXiv 2025
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
CVPR 2024 1
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation
arXiv 2024
Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
arXiv 2023
SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning
ICCV 2023 1
Affiliations
Frequent co-authors
10from 11 papers