Cite
Notes
Only stored in your browser.
Attribution
Fast On-device LLM Inference with NPUs
arXiv 2024
A Survey of Resource-efficient LLM and Multimodal Foundation Models
from 2 papers
Mengwei Xu
Xuanzhe Liu
Bingyang Wu
Chen Yang
Dongqi Cai
Gang Huang
Hao Zhang
professor
Li Zhang
Liming Yang
QiPeng Wang