0

CoMoSVC: Consistency Model-based Singing Voice Conversion

The diffusion-based Singing Voice Conversion (SVC) methods have achieved remarkable performances, producing natural audios with high similarity to the target timbre. However, the iterative sampling process results in slow inference speed, and acceleration thus becomes crucial.

Year
2024
Hosting
External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2401.01792v1
TL;DR
Semantic Scholar
Attribution policy →