Qiuqiang Kong
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis
arXiv 2025
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
arXiv 2024
Foundation Models for Music: A Survey
arXiv 2024
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
arXiv 2023
Separate Anything You Describe
arXiv 2023
Universal Source Separation with Weakly Labelled Data
arXiv 2023
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
arXiv 2023
MERTech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model With Multi-Task Finetuning
arXiv 2023
WavJourney: Compositional Audio Creation with Large Language Models
arXiv 2023
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
arXiv 2022
Neural Vocoder is All You Need for Speech Super-resolution
arXiv 2022
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation
arXiv 2021
VoiceFixer: Toward General Speech Restoration with Neural Vocoder
arXiv 2021
Affiliations
Frequent co-authors
10from 13 papers