Ruibin Yuan
- Papers
- 23
Cite
Notes
Only stored in your browser.
Authored papers
23CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction
arXiv 2026
A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression
arXiv 2026
Audio-Omni: Extending Multi-modal Understanding to Versatile Audio Generation and Editing
arXiv 2026
YuE: Scaling Open Foundation Models for Long-Form Music Generation
arXiv 2025
Kimi-Audio Technical Report
arXiv 2025
SongEval: A Benchmark Dataset for Song Aesthetics Evaluation
arXiv 2025
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
arXiv 2025
AutoMV: An Automatic Multi-Agent System for Music Video Generation
arXiv 2025
Audio-FLAN: A Preliminary Release
arXiv 2025
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
arXiv 2025
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
arXiv 2025
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages
arXiv 2025
OmniBench: Towards The Future of Universal Omni-Language Models
arXiv 2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM
arXiv 2024
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
CVPR 2025 1
Foundation Models for Music: A Survey
arXiv 2024
MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
arXiv 2024
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
arXiv 2024
You Know What I'm Saying: Jailbreak Attack via Implicit Reference
arXiv 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024 1
Chinese Open Instruction Generalist: A Preliminary Release
arXiv 2023
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
arXiv 2023
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
arXiv 2023
Affiliations
Frequent co-authors
10from 23 papers