0

CoCon: A Self-Supervised Approach for Controlled Text Generation

A self-supervised method, Content-Conditioner (CoCon), is proposed to control a language model's output at the word- and phrase-level, incorporating target content and high-level text attributes effectively.

Year
2020
Venue
ICLR 2021 1
Authors
5
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2006.03535v3ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Pretrained Transformer-based language models (LMs) display remarkable natural language generation capabilities. With their immense potential, controlling text generation of such LMs is getting attention. While there are studies that seek to control high-level attributes (such as sentiment and topic) of generated text, there is still a lack of more precise control over its content at the word- and phrase-level. Here, we propose Content-Conditioner (CoCon) to control an LM's output text with a content input, at a fine-grained level. In our self-supervised approach, the CoCon block learns to help the LM complete a partially-observed text sequence by conditioning with content inputs that are withheld from the LM. Through experiments, we show that CoCon can naturally incorporate target content into generated texts and control high-level text attributes in a zero-shot manner.

Authors

5