0

Modeling Event Plausibility with Consistent Conceptual Abstraction

Transformer models exhibit inconsistency in modeling event plausibility across conceptual classes, which can be improved with a post-hoc consistency method.

Year
2021
Venue
NAACL 2021 4
Authors
4
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2104.10247ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Understanding natural language requires common sense, one aspect of which is the ability to discern the plausibility of events. While distributional models -- most recently pre-trained, Transformer language models -- have demonstrated improvements in modeling event plausibility, their performance still falls short of humans'. In this work, we show that Transformer-based plausibility models are markedly inconsistent across the conceptual classes of a lexical hierarchy, inferring that "a person breathing" is plausible while "a dentist breathing" is not, for example. We find this inconsistency persists even when models are softly injected with lexical knowledge, and we present a simple post-hoc method of forcing model consistency that improves correlation with human plausibility judgements.

Authors

4