CoMoSVC: Consistency Model-based Singing Voice Conversion

The diffusion-based Singing Voice Conversion (SVC) methods have achieved remarkable performances, producing natural audios with high similarity to the target timbre. However, the iterative sampling process results in slow inference speed, and acceleration thus becomes crucial.

Open

Year: 2024
ArXiv: arxiv.org/abs/2401.01792
URL: arxiv.org/abs/2401.01792v1
Hosting: External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2401.01792v1
TL;DR: Semantic Scholar

Attribution policy →