Haihao Shen

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs

arXiv 2025

Effective Quantization for Diffusion Models on CPUs

arXiv 2023

Prune Once for All: Sparse Pre-Trained Language Models

arXiv 2021

No known affiliations.

from 3 papers

Heng Guo

Weiwei Zhang

Wenhua Cheng

Ariel Larey

Guy Boudoukh

Hanwen Chang

Kaokao Lv

Moshe Wasserblat

Ofir Zafrir

Xinyu Ye