Cite
Notes
Only stored in your browser.
Attribution
More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression
arXiv 2024
from 1 papers
Dawei Zhu
Jiebin Zhang
Lifeng Shang
Qun Liu
Sujian Li
Wenhao Wu
Xiaoguang Li
YiFan Song