Cite
Notes
Only stored in your browser.
Attribution
Llumnix: Dynamic Scheduling for Large Language Model Serving
arXiv 2024
from 1 papers
Hanyu Zhao
Wei Lin
Wencong Xiao
Xinyi Zhang
Yong Li
Ziming Huang