Cite
Notes
Only stored in your browser.
Attribution
Accelerating Production LLMs with Combined Token/Embedding Speculators
arXiv 2024
from 1 papers
Joshua Rosenkranz
Mudhakar Srivatsa
Pavithra Ranganathan
Raghu Ganti
Sahil Suneja
Thomas Parnell