Cite
Notes
Only stored in your browser.
Attribution
Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models
arXiv 2025
FlatQuant: Flatness Matters for LLM Quantization
arXiv 2024
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
from 3 papers
Chun Yuan
Haoli Bai
Lu Hou
Jun Yao
Xianzhi Yu
Yuening Li
Yuxuan Sun
Han Bao
Han Gao
Haokun Lin