Cite
Notes
Only stored in your browser.
Attribution
Squeezed Attention: Accelerating Long Context Length LLM Inference
arXiv 2024
from 1 papers
Amir Gholami
Coleman Hooper
Hiva Mohammadzadeh
Kurt Keutzer
Michael W. Mahoney
Monishwaran Maheswaran
Sehoon Kim