Cite
Notes
Only stored in your browser.
Attribution
Optimizing Large Language Model Training Using FP4 Quantization
arXiv 2025
Sigma-Moe-Tiny Technical Report
SIGMA: An AI-Empowered Training Stack on Early-Life Hardware
Tutel: Adaptive Mixture-of-Experts at Scale
arXiv 2022
from 4 papers
Peng Cheng
Ruizhe Wang
Xiao Liu
Yeyun Gong
Yifan Xiong
Lei Qu
Rui Gao
Tianyu Chen
Yucheng Ding
Yuting Jiang