Cite
Notes
Only stored in your browser.
Attribution
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
arXiv 2025
When Can You Get Away with Low Memory Adam?
from 2 papers
John Kirchenbauer
Tom Goldstein
Ang Li
Avi Schwarzschild
Bhavya Kailkhura
Brian R. Bartoldson
Jonas Geiping
Maissam Barkeshli
Micah Goldblum
Sean McLeish