Shijie Cao

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

MiMo-V2-Flash Technical Report

arXiv 2026

2026

Data Efficacy for Language Model Training

arXiv 2025

2025

Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning

arXiv 2025

2025

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

arXiv 2025

2025

BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation

arXiv 2024

2024

T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge

arXiv 2024

2024

AFPQ: Asymmetric Floating Point Quantization for LLMs

arXiv 2023

2023

Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference

arXiv 2023

2023

EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate

dense-to-sparse-gate-for-mixture-of-experts

2021

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Ting Cao

Jianyu Wei

Lingxiao Ma

Mao Yang

Dayou Du

Lei Wang

Li Dong

Ningyi Xu

Xin Zhang

Yijia Zhang