Denis Kuznedelev
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Scale-wise Distillation of Diffusion Models
arXiv 2025
Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models
arXiv 2025
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention
arXiv 2025
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
arXiv 2024
PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression
arXiv 2024
EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search
arXiv 2024
Does Diffusion Beat GAN in Image Super Resolution?
arXiv 2024
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
arXiv 2023
A critical look at the evaluation of GNNs under heterophily: Are we really making progress?
arXiv 2023
Sparse Fine-tuning for Inference Acceleration of Large Language Models
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers