Cite
Notes
Only stored in your browser.
Attribution
AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies
arXiv 2024
Llumnix: Dynamic Scheduling for Large Language Model Serving
from 2 papers
Biao Sun
Bo-Wen Zhang
ChengWei Wu
Dong Liang
Guang Liu
Jian Yang
Jijie Li
Li Du
Liangdong Wang
Mengdi Zhao