Ruslan Svirschevski
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4AutoJudge: Judge Decoding Without Manual Annotation
arXiv 2025
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
arXiv 2024
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
arXiv 2024
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers