Cite
Notes
Only stored in your browser.
Attribution
Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
arXiv 2023
AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation
arXiv 2022
from 2 papers
Ganesh Jawahar
researcher
Muhammad Abdul-Mageed
Aasish Pappu
Ahmed Hassan Awadallah
Barlas Oğuz
Dilin Wang
Fei Sun
Haichuan Yang
Jianfeng Gao
Meng Li