Cite
Notes
Only stored in your browser.
Attribution
Confidential Prompting: Protecting User Prompts from Cloud LLM Providers
arXiv 2024
Prompt Cache: Modular Attention Reuse for Low-Latency Inference
arXiv 2023
from 2 papers
In Gim
Anurag Khandelwal
Caihua Li
Guojun Chen
Nikhil Sarda
Seung-seob Lee