Cite
Notes
Only stored in your browser.
Attribution
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving
arXiv 2025
from 1 papers
Arvind Krishnamurthy
Baris Kasikci
Lequn Chen
Luis Ceze
Ruihang Lai
Stephanie Wang
Tianqi Chen
Wuwei Lin
Yineng Zhang
Zihao Ye