Cite
Notes
Only stored in your browser.
Attribution
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
arXiv 2025
The Mamba in the Llama: Distilling and Accelerating Hybrid Models
arXiv 2024
Faster Causal Attention Over Large Sequences Through Sparse Flash Attention
arXiv 2023
from 3 papers
Alexander M. Rush
Junxiong Wang
Tri Dao
professor / Chief Scientist
Avner May
Daniel Ritter
François Fleuret
Martin Jaggi
Matteo Pagliardini
Wen-Ding Li
grad-student