Cite
Notes
Only stored in your browser.
Attribution
Cartridges: Lightweight and general-purpose long context representations via self-study
arXiv 2025
Hydragen: High-Throughput LLM Inference with Shared Prefixes
arXiv 2024
from 2 papers
Azalia Mirhoseini
Christopher Ré
Atri Rudra
Bradley Brown
Daniel Y. Fu
Dylan Zinsley
Emily Liu
James Zou
Jordan Juravsky
Neel Guha