Cite
Notes
Only stored in your browser.
Attribution
Fast On-device LLM Inference with NPUs
arXiv 2024
from 1 papers
Daliang Xu
Gang Huang
Hao Zhang
professor
Liming Yang
Mengwei Xu
Xuanzhe Liu