Qiuqiang Kong

Papers: 13

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

13papers

Authored papers

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

arXiv 2025

2025

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

arXiv 2024

2024

Foundation Models for Music: A Survey

arXiv 2024

2024

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

arXiv 2023

2023

Separate Anything You Describe

arXiv 2023

2023

WavJourney: Compositional Audio Creation with Large Language Models

arXiv 2023

2023

MERTech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model With Multi-Task Finetuning

arXiv 2023

2023

Universal Source Separation with Weakly Labelled Data

arXiv 2023

2023

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

arXiv 2023

2023

VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration

arXiv 2022

2022

Neural Vocoder is All You Need for Speech Super-resolution

arXiv 2022

2022

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation

arXiv 2021

2021

VoiceFixer: Toward General Speech Restoration with Neural Vocoder

arXiv 2021

2021

Affiliations

No known affiliations.

Frequent co-authors

from 13 papers

Haohe Liu

Mark D. Plumbley

Xubo Liu

Yuxuan Wang

Qiao Tian

Wenwu Wang

DeLiang Wang

Xingjian Du

Xu Tan

Yan Zhao