Cite
Notes
Only stored in your browser.
Attribution
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale
arXiv 2023
from 1 papers
Dheevatsa Mudigere
Jose Gallego-Posada
Kaushik Rangadurai
Michael Rabbat
Shintaro Iwasaki
Tsung-Hsien Lee
Zhijing Li