Cite
Notes
Only stored in your browser.
Attribution
Falcon: Faster and Parallel Inference of Large Language Models through Enhanced Semi-Autoregressive Drafting and Custom-Designed Decoding Tree
arXiv 2024
from 1 papers
Feng Ji
Weisheng Xie
Xiangxiang Gao