Cite
Notes
Only stored in your browser.
Attribution
More for Keys, Less for Values: Adaptive KV Cache Quantization
arXiv 2025
from 1 papers
Lam Nguyen
Mohsen Hariri
Qifan Wang
Shaochen Zhong
Vipin Chaudhary
Xia Hu
Xiaotian Han