Cite
Notes
Only stored in your browser.
Attribution
Accelerating Production LLMs with Combined Token/Embedding Speculators
arXiv 2024
from 1 papers
Davis Wertheimer
Joshua Rosenkranz
Mudhakar Srivatsa
Pavithra Ranganathan
Raghu Ganti
Thomas Parnell