Christopher Ré
- Papers
- 36
Cite
Notes
Only stored in your browser.
Authored papers
36HipKittens: Fast and Furious AMD Kernels
arXiv 2025
Cartridges: Lightweight and general-purpose long context representations via self-study
arXiv 2025
Intelligence per Watt: Measuring Intelligence Efficiency of Local AI
arXiv 2025
ThunderKittens: Simple, Fast, and Adorable AI Kernels
arXiv 2024
Restructuring Vector Quantization with the Rotation Trick
arXiv 2024
Automating the Enterprise with Foundation Models
arXiv 2024
Smoothie: Label Free Language Model Routing
arXiv 2024
Simple linear attention language models balance the recall-throughput tradeoff
arXiv 2024
RedPajama: an Open Dataset for Training Large Language Models
arXiv 2024
LoLCATs: On Low-Rank Linearizing of Large Language Models
arXiv 2024
Archon: An Architecture Search Framework for Inference-Time Techniques
arXiv 2024
State-Free Inference of State-Space Models: The Transfer Function Approach
arXiv 2024
Just read twice: closing the recall gap for recurrent language models
arXiv 2024
Aioli: A Unified Optimization Framework for Language Model Data Mixing
arXiv 2024
Hydragen: High-Throughput LLM Inference with Shared Prefixes
arXiv 2024
Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHRs
arXiv 2024
H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
arXiv 2023
Hyena Hierarchy: Towards Larger Convolutional Language Models
arXiv 2023
Effectively Modeling Time Series with Simple Discrete State Spaces
arXiv 2023
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
arXiv 2023
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
legalbench-a-collaboratively-built-benchmark
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
arXiv 2023
TART: A plug-and-play Transformer module for task-agnostic reasoning
arXiv 2023
Context-Aware Meta-Learning
arXiv 2023
Transform Once: Efficient Operator Learning in Frequency Domain
arXiv 2022
How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis Projections
arXiv 2022
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
arXiv 2022
Monarch: Expressive Structured Matrices for Efficient and Accurate Training
arXiv 2022
Can Foundation Models Wrangle Your Data?
arXiv 2022
VORTEX: Physics-Driven Data Augmentations Using Consistency Training for Robust Accelerated MRI Reconstruction
vortex-physics-driven-data-augmentations-for
Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text
Findings (EMNLP) 2021 11
Robustness Gym: Unifying the NLP Evaluation Landscape
NAACL 2021 4
Efficiently Modeling Long Sequences with Structured State Spaces
efficiently-modeling-long-sequences-with
Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models
pixelated-butterfly-simple-and-efficient
Scatterbrain: Unifying Sparse and Low-rank Attention Approximation
scatterbrain-unifying-sparse-and-low-rank-1
HiPPO: Recurrent Memory with Optimal Polynomial Projections
NeurIPS 2020 12
Affiliations
Frequent co-authors
10from 36 papers