Cite
Notes
Only stored in your browser.
Attribution
ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition
arXiv 2024
from 1 papers
Lu Ye
Yang Li
Ze Tao