Size Zheng
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Comet: Fine-grained Computation-communication Overlapping for Mixture-of-Experts
arXiv 2025
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
arXiv 2024
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
arXiv 2024
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers