Cite
Notes
Only stored in your browser.
Attribution
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU
arXiv 2023
from 1 papers
Haibo Chen
Haotong Xie
Zeyu Mi