Cite
Notes
Only stored in your browser.
Attribution
FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference
arXiv 2025
from 1 papers
Jianqiao Lu
Xun Zhou
Xunhao Lai
Yiyuan Ma