0

Dear Sir or Madam, May I introduce the GYAFC Dataset: Corpus, Benchmarks and Metrics for Formality Style Transfer

A large corpus for formality style transfer is created, demonstrating machine translation techniques as strong baselines, while discussing challenges with automatic evaluation metrics.

Year
2018
Venue
dear-sir-or-madam-may-i-introduce-the-gyafc-1
Authors
2
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/1803.06535v2ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Style transfer is the task of automatically transforming a piece of text in one particular style into another. A major barrier to progress in this field has been a lack of training and evaluation datasets, as well as benchmarks and automatic metrics. In this work, we create the largest corpus for a particular stylistic transfer (formality) and show that techniques from the machine translation community can serve as strong baselines for future work. We also discuss challenges of using automatic metrics.

Authors

2