Cite
Notes
Only stored in your browser.
Attribution
LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding
arXiv 2026
from 1 papers
Baotian Hu
Dongfang Li
Gang Lin
Min Zhang
Xuhui Chen
Yukun Shi