Cite
Notes
Only stored in your browser.
Attribution
Llumnix: Dynamic Scheduling for Large Language Model Serving
arXiv 2024
from 1 papers
Biao Sun
Hanyu Zhao
Wei Lin
Xinyi Zhang
Yong Li
Ziming Huang