Qi Qin
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model
arXiv 2026
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
arXiv 2026
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing
arXiv 2026
Accelerating Masked Image Generation by Learning Latent Controlled Dynamics
arXiv 2026
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
arXiv 2025
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
ICCV 2025
OmniCaptioner: One Captioner to Rule Them All
arXiv 2025
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
arXiv 2025
Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis
arXiv 2025
LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis
arXiv 2025
Judge Anything: MLLM as a Judge Across Any Modality
arXiv 2025
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark
arXiv 2024
M3-Jepa: Multimodal Alignment via Multi-directional MoE based on the JEPA framework
arXiv 2024
Affiliations
Frequent co-authors
10from 13 papers