Cite
Notes
Only stored in your browser.
Attribution
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
arXiv 2025
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge
arXiv 2024
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate
dense-to-sparse-gate-for-mixture-of-experts
from 3 papers
Shijie Cao
Lei Wang
Mao Yang
Ting Cao
Bin Cui
Fan Yang
Hayden Kwok-Hay So
Jianyu Wei
Jilong Xue
Li Dong