Cite
Notes
Only stored in your browser.
Attribution
Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge
arXiv 2024
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate
dense-to-sparse-gate-for-mixture-of-experts
from 2 papers
Bin Cui
Bin Xiao
Chunan Shi
Fan Yang
Jilong Xue
Lei Su
Lingxiao Ma
Qibin Liu
Shijie Cao
WeiPeng Chen