Existing Machine Learning techniques yield close to human performance on text-based classification tasks. However, the presence of multi-modal noise in chat data such as emoticons, slang, spelling mistakes, code-mixed data, etc. makes existing deep-learning solutions perform poorly. The inability of deep-learning systems to robustly capture these covariates puts a cap on their performance. We propose NELEC: Neural and Lexical Combiner, a system which elegantly combines textual and deep-learning based methods for sentiment classification. We evaluate our system as part of the third task of 'Contextual Emotion Detection in Text' as part of SemEval-2019. Our system performs significantly better than the baseline, as well as our deep-learning model benchmarks. It achieved a micro-averaged F1 score of 0.7765, ranking 3rd on the test-set leader-board. Our code is available at https://github.com/iamgroot42/nelec
NELEC at SemEval-2019 Task 3: Think Twice Before Going Deep
A system named NELEC combines neural and lexical methods to improve sentiment classification in noisy chat data, achieving better performance than deep learning benchmarks on SemEval-2019's Contextual Emotion Detection in Text task.
- Year
- 2019
- Venue
- nelec-at-semeval-2019-task-3-think-twice-1
- Authors
- 2
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/1904.03223ARXIV-DEFAULT
- TL;DR
- Semantic Scholar