Cite
Notes
Only stored in your browser.
Attribution
FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference
arXiv 2025
Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
arXiv 2024
from 2 papers
Changsheng Li
Guoren Wang
Jianqiao Lu
Kaituo Feng
Xi Chen
Xiangyu Yue
Xun Zhou
Yao Luo
Ye Yuan
Yiyuan Ma