Cite
Notes
Only stored in your browser.
Attribution
DFlash: Block Diffusion for Flash Speculative Decoding
arXiv 2026
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
arXiv 2025
from 2 papers
Zhijian Liu
Haisheng Chen
Jian Chen
Song Han