Jiahao Pan

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

YuE: Scaling Open Foundation Models for Long-Form Music Generation

arXiv 2025

2025

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

arXiv 2025

2025

Audio-FLAN: An Instruction-Following Dataset for Unified Audio Understanding and Generation of Speech, Music, and Sound

arXiv 2025

2025

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

arXiv 2025

2025

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

arXiv 2024

2024

ComposerX: Multi-Agent Symbolic Music Composition with LLMs

arXiv 2024

2024

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

arXiv 2024

2024

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

CVPR 2025 1

2024

LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Yike Guo

Wei Xue

Ruibin Yuan

Zeyue Tian

Emmanouil Benetos

Ge Zhang

researcher

Yinghao Ma

Chenghua Lin

Liumeng Xue

Qifeng Liu