Wenwu Wang
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11ComposerX: Multi-Agent Symbolic Music Composition with LLMs
arXiv 2024
AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models
arXiv 2024
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
arXiv 2024
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
arXiv 2023
Separate Anything You Describe
arXiv 2023
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
arXiv 2023
WavJourney: Compositional Audio Creation with Large Language Models
arXiv 2023
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
arXiv 2023
Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities
arXiv 2023
Sparks of Large Audio Models: A Survey and Outlook
arXiv 2023
ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification
arXiv 2022
Affiliations
Frequent co-authors
10from 11 papers