0

Efficient Purely Convolutional Text Encoding

A lightweight convolutional architecture for sentence embeddings is proposed, improving recursive convolutional auto-encoding for byte-level text with reduced training time, parameters, and improved accuracy.

Year
2018
Venue
arXiv 2018
Authors
3
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/1808.01160ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

In this work, we focus on a lightweight convolutional architecture that creates fixed-size vector embeddings of sentences. Such representations are useful for building NLP systems, including conversational agents. Our work derives from a recently proposed recursive convolutional architecture for auto-encoding text paragraphs at byte level. We propose alternations that significantly reduce training time, the number of parameters, and improve auto-encoding accuracy. Finally, we evaluate the representations created by our model on tasks from SentEval benchmark suite, and show that it can serve as a better, yet fairly low-resource alternative to popular bag-of-words embeddings.

Authors

3