Helin Wang
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11HeartMuLa: A Family of Open Sourced Music Foundation Models
arXiv 2026
A Semantically Consistent Dataset for Data-Efficient Query-Based Universal Sound Separation
arXiv 2026
SAM Audio: Segment Anything in Audio
arXiv 2025
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline
arXiv 2025
Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits
arXiv 2025
CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech
arXiv 2025
Noise-robust Speech Separation with Fast Generative Correction
arXiv 2024
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer
arXiv 2024
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
arXiv 2024
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer
arXiv 2024
Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
arXiv 2023
Affiliations
Frequent co-authors
10from 11 papers