Cite
Notes
Only stored in your browser.
Attribution
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
arXiv 2024
from 1 papers
Cong Xie
Ding Zhou
Haibin Lin
Haohan Xu
Haoran Wei
Jianxi Ye
Leqi Zou
Liang Xiang
Pengfei Nie
Qi Hou