Xupeng Miao
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
arXiv 2024
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models
arXiv 2024
Generative Dense Retrieval: Memory Can Be a Burden
arXiv 2024
Experimental Analysis of Large-scale Learnable Vector Storage Compression
arXiv 2023
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate
dense-to-sparse-gate-for-mixture-of-experts
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers