Cite
Notes
Only stored in your browser.
Attribution
Simple linear attention language models balance the recall-throughput tradeoff
arXiv 2024
Just read twice: closing the recall gap for recurrent language models
How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis Projections
arXiv 2022
from 3 papers
Atri Rudra
Christopher Ré
Sabri Eyuboglu
Simran Arora
Aaryan Singhal
Albert Gu
Ashish Rao
Benjamin Spector
Dylan Zinsley
Isys Johnson