Cite
Notes
Only stored in your browser.
Attribution
Sparse Fine-tuning for Inference Acceleration of Large Language Models
arXiv 2023
The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models
arXiv 2022
from 2 papers
Dan Alistarh
Eldar Kurtic
Elias Frantar
Benjamin Fineran
Daniel Campos
Denis Kuznedelev
Mark Kurtz
Tuan Nguyen