0

It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations

Perturbing inflectional morphology in training data improves NLP models' robustness to biases without affecting clean data performance.

Year
2020
Venue
it-s-morphin-time-combating-linguistic-1
Authors
4
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2005.04364ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Training on only perfect Standard English corpora predisposes pre-trained neural networks to discriminate against minorities from non-standard linguistic backgrounds (e.g., African American Vernacular English, Colloquial Singapore English, etc.). We perturb the inflectional morphology of words to craft plausible and semantically similar adversarial examples that expose these biases in popular NLP models, e.g., BERT and Transformer, and show that adversarially fine-tuning them for a single epoch significantly improves robustness without sacrificing performance on clean data.

Authors

4