Saleh Ashkboos
- Papers
- 7
Cite
Notes
Only stored in your browser.
Authored papers
7Quartet: Native FP4 Training Can Be Optimal for Large Language Models
arXiv 2025
HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs
arXiv 2025
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
arXiv 2024
QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
arXiv 2024
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
arXiv 2023
QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models
arXiv 2023
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
arXiv 2022
Affiliations
Frequent co-authors
10from 7 papers
Dan Alistarh
Torsten Hoefler
Elias Frantar
James Hensman
Mahdi Nikdan
Maximilian L. Croci
Roberto L. Castro
Soroush Tabesh
Alexander Borzunov
Amirkeivan Mohtashami