0

X-Stance: A Multilingual Multi-Target Dataset for Stance Detection

A large-scale stance detection dataset from multilingual election comments is used to train a single model for cross-lingual and cross-target stance detection, with moderate success demonstrated by multilingual BERT in a zero-shot setting.

Year
2020
Venue
arXiv 2020
Authors
2
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2003.08385v2ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

We extract a large-scale stance detection dataset from comments written by candidates of elections in Switzerland. The dataset consists of German, French and Italian text, allowing for a cross-lingual evaluation of stance detection. It contains 67 000 comments on more than 150 political issues (targets). Unlike stance detection models that have specific target issues, we use the dataset to train a single model on all the issues. To make learning across targets possible, we prepend to each instance a natural question that represents the target (e.g. "Do you support X?"). Baseline results from multilingual BERT show that zero-shot cross-lingual and cross-target transfer of stance detection is moderately successful with this approach.

Authors

2