Yi Xin
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model
arXiv 2026
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models
arXiv 2026
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
arXiv 2026
Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models
arXiv 2026
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing
arXiv 2026
Accelerating Masked Image Generation by Learning Latent Controlled Dynamics
arXiv 2026
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing
arXiv 2026
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
arXiv 2025
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
ICCV 2025
Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models
arXiv 2025
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
arXiv 2025
Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis
arXiv 2025
From Masks to Worlds: A Hitchhiker's Guide to World Models
arXiv 2025
UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation
arXiv 2025
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
ICCV 2025
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark
arXiv 2024
Affiliations
Frequent co-authors
10from 16 papers