Cite
Notes
Only stored in your browser.
Attribution
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes
large-batch-optimization-for-deep-learning
from 1 papers
Cho-Jui Hsieh
James Demmel
Jing Li
Jonathan Hseu
Kurt Keutzer
Sanjiv Kumar
Srinadh Bhojanapalli
Xiaodan Song
Yang You