Cite
Notes
Only stored in your browser.
Attribution
Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge
arXiv 2023
from 1 papers
Chao Wu
Lei Lu
Ming Lin
Peiyan Dong
Xuan Shen
Yanzhi Wang
Zhenglun Kong