Cite
Notes
Only stored in your browser.
Attribution
CHAI: Clustered Head Attention for Efficient LLM Inference
arXiv 2024
Few-shot Fine-tuning is All You Need for Source-free Domain Adaptation
arXiv 2023
from 2 papers
Basil Hosmer
Bilge Acun
Carole-Jean Wu
Dimitris Papailiopoulos
Jihyo Kim
Mostafa Elhoushi
Sangheum Hwang
Saurabh Agarwal
Seungwon Seo
Shivaram Venkataraman