Cite
Notes
Only stored in your browser.
Attribution
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization
arXiv 2024
from 1 papers
Fartash Faghri
Iman Mirzadeh
Keivan Alizadeh Vahid
Mehrdad Farajtabar
Minsik Cho
Mohammad Samragh
Moin Nabi