Cite
Notes
Only stored in your browser.
Attribution
Early Weight Averaging meets High Learning Rates for LLM Pre-training
arXiv 2023
from 1 papers
Abhishek Kumar
Jean Kaddour
Sujay Sanghavi
Sunny Sanyal