Jiahao Pan
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9YuE: Scaling Open Foundation Models for Long-Form Music Generation
arXiv 2025
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
arXiv 2025
ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
arXiv 2025
Audio-FLAN: A Preliminary Release
arXiv 2025
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
arXiv 2024
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
CVPR 2025 1
MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
arXiv 2024
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
arXiv 2024
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers