0

Evaluating Semantic Accuracy of Data-to-Text Generation with Natural Language Inference

A new metric using a neural model for natural language inference evaluates semantic accuracy in data-to-text generation by checking textual entailment.

Year
2020
Venue
INLG (ACL) 2020 12
Authors
2
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2011.10819ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

A major challenge in evaluating data-to-text (D2T) generation is measuring the semantic accuracy of the generated text, i.e. checking if the output text contains all and only facts supported by the input data. We propose a new metric for evaluating the semantic accuracy of D2T generation based on a neural model pretrained for natural language inference (NLI). We use the NLI model to check textual entailment between the input data and the output text in both directions, allowing us to reveal omissions or hallucinations. Input data are converted to text for NLI using trivial templates. Our experiments on two recent D2T datasets show that our metric can achieve high accuracy in identifying erroneous system outputs.

Authors

2