Cite
Notes
Only stored in your browser.
Attribution
QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models
arXiv 2023
from 1 papers
Dan Alistarh
Elias Frantar
Ilia Markov
Jie Ren
Saleh Ashkboos
Tingxuan Zhong
Torsten Hoefler