Cite
Notes
Only stored in your browser.
Attribution
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
transformer-xl-attentive-language-models-1
XLNet: Generalized Autoregressive Pretraining for Language Understanding
xlnet-generalized-autoregressive-pretraining-1
from 2 papers
Quoc V. Le
Ruslan Salakhutdinov
professor
Yiming Yang
Zhilin Yang
Zihang Dai