Jianzong Wu
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Towards Customized Multimodal Role-Play
arXiv 2026
Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model
arXiv 2025
VMoBA: Mixture-of-Block Attention for Video Diffusion Models
arXiv 2025
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
ICCV 2025
An Empirical Study of GPT-4o Image Generation Capabilities
arXiv 2025
Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation
arXiv 2025
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
CVPR 2025 1
RelationBooth: Towards Relation-Aware Customized Object Generation
arXiv 2024
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
ICCV 2023 1
Affiliations
Frequent co-authors
10from 9 papers