Cite
Notes
Only stored in your browser.
Attribution
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
arXiv 2024
PyTorch Distributed: Experiences on Accelerating Data Parallel Training
arXiv 2020
from 2 papers
Alek Andreev
Aleksandar Botev
Andy Brock
Antonia Paterson
Anushan Fernando
Armand Joulin
Arnaud Doucet
Arthur Zucker
Brian Vaughan
Cassidy Hardin