Cross-lingual speech emotion recognition is an important task for practical applications. The performance of automatic speech emotion recognition systems degrades in cross-corpus scenarios, particularly in scenarios involving multiple languages or a previously unseen language such as Urdu for which limited or no data is available. In this study, we investigate the problem of cross-lingual emotion recognition for Urdu language and contribute URDU---the first ever spontaneous Urdu-language speech emotion database. Evaluations are performed using three different Western languages against Urdu and experimental results on different possible scenarios suggest various interesting aspects for designing more adaptive emotion recognition system for such limited languages. In results, selecting training instances of multiple languages can deliver comparable results to baseline and augmentation a fraction of testing language data while training can help to boost accuracy for speech emotion recognition. URDU data is publicly available for further research.
Cross Lingual Speech Emotion Recognition: Urdu vs. Western Languages
Evaluations of cross-lingual speech emotion recognition systems using URDU, the first spontaneous Urdu-language speech emotion database, indicate that training with multiple languages and augmenting test language data can improve performance in resource-limited settings.
- Year
- 2018
- Venue
- arXiv 2018
- Authors
- 4
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/1812.10411v2ARXIV-DEFAULT
- TL;DR
- Semantic Scholar