Cite
Notes
Only stored in your browser.
Attribution
ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals
arXiv 2024
from 1 papers
Kaushik Roy
Utkarsh Saxena
Xin Wang