This paper describes a pipeline for collecting acoustic scene data by using crowdsourcing. The detailed process of crowdsourcing is explained, including planning, validation criteria, and actual user interfaces. As a result of data collection, we present CochlScene, a novel dataset for acoustic scene classification. Our dataset consists of 76k samples collected from 831 participants in 13 acoustic scenes. We also propose a manual data split of training, validation, and test sets to increase the reliability of the evaluation results. Finally, we provide a baseline system for future research.
CochlScene: Acquisition of acoustic scene data using crowdsourcing
This paper describes a pipeline for collecting acoustic scene data by using crowdsourcing.
- Year
- 2022
- Venue
- arXiv 2022
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2211.02289ARXIV-DEFAULT
- TL;DR
- Semantic Scholar