Cite
Notes
Only stored in your browser.
Attribution
Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference
arXiv 2024
from 1 papers
Dongsheng Li
Libo Zhang
Songzhu Mei
Zhaoning Zhang