Vage Egiazarian
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6AutoJudge: Judge Decoding Without Manual Annotation
arXiv 2025
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization
arXiv 2025
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention
arXiv 2025
Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models
arXiv 2025
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
arXiv 2023
Neural Optimal Transport with General Cost Functionals
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers