We present the Latvian Twitter Eater Corpus - a set of tweets in the narrow domain related to food, drinks, eating and drinking. The corpus has been collected over time-span of over 8 years and includes over 2 million tweets entailed with additional useful data. We also separate two sub-corpora of question and answer tweets and sentiment annotated tweets. We analyse contents of the corpus and demonstrate use-cases for the sub-corpora by training domain-specific question-answering and sentiment-analysis models using data from the corpus.
What Can We Learn From Almost a Decade of Food Tweets
The Latvian Twitter Eater Corpus, consisting of over 2 million tweets on food and drinks, is analyzed and used to train domain-specific question-answering and sentiment-analysis models.
- Year
- 2020
- Venue
- arXiv 2020
- Authors
- 2
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2007.05194v2ARXIV-DEFAULT
- TL;DR
- Semantic Scholar