Cite
Notes
Only stored in your browser.
Attribution
QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference
arXiv 2024
from 1 papers
Daehyun Ahn
HyungJun Kim
Jiwoong Choi
Jongho Lee
Minkyu Kim
Taesu Kim