Cite
Notes
Only stored in your browser.
Attribution
ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
arXiv 2024
from 1 papers
Chao Zeng
Fangmin Chen
Hong Liu
Miao Wei
Shu Yang
Songwei Liu
Xing Mei
Yusheng Xie