Cite
Notes
Only stored in your browser.
Attribution
Comet: Fine-grained Computation-communication Overlapping for Mixture-of-Experts
arXiv 2025
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
arXiv 2024
from 2 papers
Haibin Lin
Qi Hou
Xin Liu
Chengquan Jiang
Cong Xie
Ding Zhou
Haohan Xu
Haoran Wei
Hongmin Chen
Jianxi Ye