Cite
Notes
Only stored in your browser.
Attribution
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
arXiv 2024
from 1 papers
April Yang
Colin Unger
Gabriele Oliaro
Mengdi Wu
Remi Delacourt
Ruohan Gao
Vineeth Kada
Xinhao Cheng
Xupeng Miao
Yingcheng Wang