Shijie Cao
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9MiMo-V2-Flash Technical Report
arXiv 2026
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
arXiv 2025
Data Efficacy for Language Model Training
arXiv 2025
Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
arXiv 2025
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation
arXiv 2024
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge
arXiv 2024
AFPQ: Asymmetric Floating Point Quantization for LLMs
arXiv 2023
Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference
arXiv 2023
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate
dense-to-sparse-gate-for-mixture-of-experts
Affiliations
Frequent co-authors
10from 9 papers