Cite
Notes
Only stored in your browser.
Attribution
Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models
arXiv 2024
LQER: Low-Rank Quantization Error Reconstruction for LLMs
Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
arXiv 2023
from 3 papers
Cheng Zhang
Yiren Zhao
George A. Constantinides
Aaron Thomas
Ilia Shumailov
Xinxin Liu
Xitong Gao