Cite
Notes
Only stored in your browser.
Attribution
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training
arXiv 2025
SageAttention2++: A More Efficient Implementation of SageAttention2
SpargeAttention: Accurate and Training-free Sparse Attention Accelerating Any Model Inference
from 3 papers
Haofeng Huang
Jianfei Chen
Jintao Zhang
Jun Zhu
Chendong Xiang
Pengle Zhang
Xiaoming Xu
Haocheng Xi
Haoxu Wang
Kai Jiang