Cite
Notes
Only stored in your browser.
Attribution
SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs
arXiv 2025
Effective Quantization for Diffusion Models on CPUs
arXiv 2023
Prune Once for All: Sparse Pre-Trained Language Models
arXiv 2021
from 3 papers
Heng Guo
Weiwei Zhang
Wenhua Cheng
Ariel Larey
Guy Boudoukh
Hanwen Chang
Kaokao Lv
Moshe Wasserblat
Ofir Zafrir
Xinyu Ye