Improve Transformer Models with Better Relative Position Embeddings
Transformer architectures rely on explicit position encodings in order to preserve a notion of word order. In this paper, we argue that existing work does not fully utilize position information.
- Year
- 2020
- Venue
- Findings of the Association for Computational Linguistics 2020
- Authors
- 4
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.