In this work, we focus on a lightweight convolutional architecture that creates fixed-size vector embeddings of sentences. Such representations are useful for building NLP systems, including conversational agents. Our work derives from a recently proposed recursive convolutional architecture for auto-encoding text paragraphs at byte level. We propose alternations that significantly reduce training time, the number of parameters, and improve auto-encoding accuracy. Finally, we evaluate the representations created by our model on tasks from SentEval benchmark suite, and show that it can serve as a better, yet fairly low-resource alternative to popular bag-of-words embeddings.
Efficient Purely Convolutional Text Encoding
A lightweight convolutional architecture for sentence embeddings is proposed, improving recursive convolutional auto-encoding for byte-level text with reduced training time, parameters, and improved accuracy.
- Year
- 2018
- Venue
- arXiv 2018
- Authors
- 3
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/1808.01160ARXIV-DEFAULT
- TL;DR
- Semantic Scholar