Cite
Notes
Only stored in your browser.
Attribution
Rethinking Optimization and Architecture for Tiny Language Models
arXiv 2024
Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting
EMS-SD: Efficient Multi-sample Speculative Decoding for Accelerating Large Language Models
from 3 papers
Kai Han
Yehui Tang
Yunhe Wang
Fangcheng Liu
Chuanjian Liu
Shangling Jui
Sichao Liu
Yi-Qi Hu
Yuchuan Tian
Zhenhua Liu