Tri Dao
Princeton CS professor and Chief Scientist at Together AI; author of FlashAttention and the Mamba state-space model.
- Role
- professor / Chief Scientist
- Currently at
- Princeton University
- twitter.com/tri_dao
- GitHub
- github.com/tridao
- Scholar
- scholar.google.com/citations
- Papers
- 23
Cite
Notes
Only stored in your browser.
Authored papers
23Introspective Diffusion Language Models
arXiv 2026
Log-Linear Attention
arXiv 2025
Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
arXiv 2025
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
arXiv 2025
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision
arXiv 2024
An Empirical Study of Mamba-based Language Models
arXiv 2024
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
arXiv 2024
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling
arXiv 2024
Marconi: Prefix Caching for the Era of Hybrid LLMs
arXiv 2024
Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers
arXiv 2024
RedPajama: an Open Dataset for Training Large Language Models
arXiv 2024
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality
arXiv 2024
The Mamba in the Llama: Distilling and Accelerating Hybrid Models
arXiv 2024
BitDelta: Your Fine-Tune May Only Be Worth One Bit
arXiv 2024
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
arXiv 2023
Hyena Hierarchy: Towards Larger Convolutional Language Models
arXiv 2023
Effectively Modeling Time Series with Simple Discrete State Spaces
arXiv 2023
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
arXiv 2022
Monarch: Expressive Structured Matrices for Efficient and Accurate Training
arXiv 2022
Transform Once: Efficient Operator Learning in Frequency Domain
arXiv 2022
Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models
pixelated-butterfly-simple-and-efficient
Scatterbrain: Unifying Sparse and Low-rank Attention Approximation
scatterbrain-unifying-sparse-and-low-rank-1
HiPPO: Recurrent Memory with Optimal Polynomial Projections
NeurIPS 2020 12
Affiliations
Frequent co-authors
10from 23 papers