0

MAFALDA: A Benchmark and Comprehensive Study of Fallacy Detection and Classification

MAFALDA is a benchmark for fallacy classification that combines existing datasets, introduces a new taxonomy and annotation scheme, and evaluates language models and human performance in detecting and classifying fallacies under zero-shot conditions.

Year
2023
Venue
arXiv 2023
Authors
5
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2311.09761v2ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

We introduce MAFALDA, a benchmark for fallacy classification that merges and unites previous fallacy datasets. It comes with a taxonomy that aligns, refines, and unifies existing classifications of fallacies. We further provide a manual annotation of a part of the dataset together with manual explanations for each annotation. We propose a new annotation scheme tailored for subjective NLP tasks, and a new evaluation method designed to handle subjectivity. We then evaluate several language models under a zero-shot learning setting and human performances on MAFALDA to assess their capability to detect and classify fallacies.

Authors

5