Cite
Notes
Only stored in your browser.
Attribution
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
arXiv 2024
from 1 papers
Cong Xie
Ding Zhou
Haibin Lin
Haoran Wei
Hongmin Chen
Jianxi Ye
Leqi Zou
Liang Xiang
Pengfei Nie
Qi Hou