Mengqi Huang
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15Lance: Unified Multimodal Modeling by Multi-Task Synergy
arXiv 2026
Stream-T1: Test-Time Scaling for Streaming Video Generation
arXiv 2026
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
arXiv 2026
NativeTok: Native Visual Tokenization for Improved Image Generation
arXiv 2026
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation
ICCV 2025
D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation
arXiv 2025
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
ICCV 2025
USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning
arXiv 2025
UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward
arXiv 2025
MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation
arXiv 2025
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
ICCV 2025
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control
arXiv 2024
RealCustom++: Representing Images as Real-Word for Real-Time Customization
arXiv 2024
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
CVPR 2024 1
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
towards-accurate-image-coding-improved
Affiliations
Frequent co-authors
10from 15 papers