Cite
Notes
Only stored in your browser.
Attribution
CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios
arXiv 2024
Evaluating Quantized Large Language Models
from 2 papers
Guohao Dai
Shengen Yan
Shiyao Li
Xuefei Ning
Yu Wang
Huazhong Yang
Tengxuan Liu
Xiangsheng Shi
Zhihang Yuan