Ideology is at the core of political science research. Yet, there still does not exist general-purpose tools to characterize and predict ideology across different genres of text. To this end, we study Pretrained Language Models using novel ideology-driven pretraining objectives that rely on the comparison of articles on the same story written by media of different ideologies. We further collect a large-scale dataset, consisting of more than 3.6M political news articles, for pretraining. Our model POLITICS outperforms strong baselines and the previous state-of-the-art models on ideology prediction and stance detection tasks. Further analyses show that POLITICS is especially good at understanding long or formally written texts, and is also robust in few-shot learning scenarios.
POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection
A study on Pretrained Language Models with ideology-driven objectives improves ideology prediction and stance detection, demonstrating robust performance in understanding long texts and few-shot learning.
- Year
- 2022
- Venue
- Findings (NAACL) 2022 7
- Authors
- 5
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2205.00619ARXIV-DEFAULT
- TL;DR
- Semantic Scholar