0

Neural Academic Paper Generation

Recent language modeling techniques, particularly Transformers and Transformer-XL, are applied to generate $\LaTeX{}$ code for academic papers using a prepared dataset of open-source computer vision papers.

Year
2019
Venue
arXiv 2019
Authors
3
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/1912.01982ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

In this work, we tackle the problem of structured text generation, specifically academic paper generation in $\LaTeX{}$, inspired by the surprisingly good results of basic character-level language models. Our motivation is using more recent and advanced methods of language modeling on a more complex dataset of $\LaTeX{}$ source files to generate realistic academic papers. Our first contribution is preparing a dataset with $\LaTeX{}$ source files on recent open-source computer vision papers. Our second contribution is experimenting with recent methods of language modeling and text generation such as Transformer and Transformer-XL to generate consistent $\LaTeX{}$ code. We report cross-entropy and bits-per-character (BPC) results of the trained models, and we also discuss interesting points on some examples of the generated $\LaTeX{}$ code.

Authors

3