Cite
Notes
Only stored in your browser.
Attribution
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
arXiv 2024
from 1 papers
Avner May
Beidi Chen
Max Ryabinin
Ruslan Svirschevski
Zhihao Jia
Zhuoming Chen