Yi Yuan
- Papers
- 8
Cite
Notes
Only stored in your browser.
8papers
Authored papers
8Efficient Document Parsing via Parallel Token Prediction
arXiv 2026
Ming-Omni: A Unified Multimodal Model for Perception and Generation
arXiv 2025
HumanSense: From Multimodal Perception to Empathetic Context-Aware Responses through Reasoning MLLMs
arXiv 2025
M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning
arXiv 2025
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
arXiv 2023
Separate Anything You Describe
arXiv 2023
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
arXiv 2023
WavJourney: Compositional Audio Creation with Large Language Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 8 papers