Cite
Notes
Only stored in your browser.
Attribution
CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference
arXiv 2025
from 1 papers
Bei Yu
Hui-Ling Zhen
Mingxuan Yuan
Sinno Jialin Pan
Wulong Liu
Xianzhi Yu
Zehua Pei