0

Ruibin Yuan

Papers
23

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
23papers

Authored papers

23

CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction

arXiv 2026

2026

A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression

arXiv 2026

2026

Audio-Omni: Extending Multi-modal Understanding to Versatile Audio Generation and Editing

arXiv 2026

2026

YuE: Scaling Open Foundation Models for Long-Form Music Generation

arXiv 2025

2025

Kimi-Audio Technical Report

arXiv 2025

2025

SongEval: A Benchmark Dataset for Song Aesthetics Evaluation

arXiv 2025

2025

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

arXiv 2025

2025

AutoMV: An Automatic Multi-Agent System for Music Video Generation

arXiv 2025

2025

Audio-FLAN: A Preliminary Release

arXiv 2025

2025

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

arXiv 2025

2025

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

arXiv 2025

2025

CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages

arXiv 2025

2025

OmniBench: Towards The Future of Universal Omni-Language Models

arXiv 2024

2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM

arXiv 2024

2024

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

CVPR 2025 1

2024

Foundation Models for Music: A Survey

arXiv 2024

2024

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

arXiv 2024

2024

ComposerX: Multi-Agent Symbolic Music Composition with LLMs

arXiv 2024

2024

You Know What I'm Saying: Jailbreak Attack via Implicit Reference

arXiv 2024

2024

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

CVPR 2024 1

2023

Chinese Open Instruction Generalist: A Preliminary Release

arXiv 2023

2023

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training

arXiv 2023

2023

LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 23 papers