0

Christopher Ré

Papers
36

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
36papers

Authored papers

36

HipKittens: Fast and Furious AMD Kernels

arXiv 2025

2025

Cartridges: Lightweight and general-purpose long context representations via self-study

arXiv 2025

2025

Intelligence per Watt: Measuring Intelligence Efficiency of Local AI

arXiv 2025

2025

ThunderKittens: Simple, Fast, and Adorable AI Kernels

arXiv 2024

2024

Restructuring Vector Quantization with the Rotation Trick

arXiv 2024

2024

Automating the Enterprise with Foundation Models

arXiv 2024

2024

Smoothie: Label Free Language Model Routing

arXiv 2024

2024

Simple linear attention language models balance the recall-throughput tradeoff

arXiv 2024

2024

RedPajama: an Open Dataset for Training Large Language Models

arXiv 2024

2024

LoLCATs: On Low-Rank Linearizing of Large Language Models

arXiv 2024

2024

Archon: An Architecture Search Framework for Inference-Time Techniques

arXiv 2024

2024

State-Free Inference of State-Space Models: The Transfer Function Approach

arXiv 2024

2024

Just read twice: closing the recall gap for recurrent language models

arXiv 2024

2024

Aioli: A Unified Optimization Framework for Language Model Data Mixing

arXiv 2024

2024

Hydragen: High-Throughput LLM Inference with Shared Prefixes

arXiv 2024

2024

Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHRs

arXiv 2024

2024

H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

arXiv 2023

2023

Hyena Hierarchy: Towards Larger Convolutional Language Models

arXiv 2023

2023

Effectively Modeling Time Series with Simple Discrete State Spaces

arXiv 2023

2023

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

arXiv 2023

2023

LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

legalbench-a-collaboratively-built-benchmark

2023

Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time

arXiv 2023

2023

TART: A plug-and-play Transformer module for task-agnostic reasoning

arXiv 2023

2023

Context-Aware Meta-Learning

arXiv 2023

2023

Transform Once: Efficient Operator Learning in Frequency Domain

arXiv 2022

2022

How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis Projections

arXiv 2022

2022

Hungry Hungry Hippos: Towards Language Modeling with State Space Models

arXiv 2022

2022

Monarch: Expressive Structured Matrices for Efficient and Accurate Training

arXiv 2022

2022

Can Foundation Models Wrangle Your Data?

arXiv 2022

2022

VORTEX: Physics-Driven Data Augmentations Using Consistency Training for Robust Accelerated MRI Reconstruction

vortex-physics-driven-data-augmentations-for

2021

Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text

Findings (EMNLP) 2021 11

2021

Robustness Gym: Unifying the NLP Evaluation Landscape

NAACL 2021 4

2021

Efficiently Modeling Long Sequences with Structured State Spaces

efficiently-modeling-long-sequences-with

2021

Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models

pixelated-butterfly-simple-and-efficient

2021

Scatterbrain: Unifying Sparse and Low-rank Attention Approximation

scatterbrain-unifying-sparse-and-low-rank-1

2021

HiPPO: Recurrent Memory with Optimal Polynomial Projections

NeurIPS 2020 12

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 36 papers