Cite
Notes
Only stored in your browser.
Attribution
EdgeMoE: Fast On-Device Inference of MoE-based Large Language Models
arXiv 2023
from 1 papers
Ao Zhou
Liwei Guo
Mengwei Xu
Rongjie Yi
Shangguang Wang