Cite
Notes
Only stored in your browser.
Attribution
Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference
arXiv 2024
from 1 papers
Baizhou Xu
Dongsheng Li
Libo Zhang
Zhaoning Zhang