Cite
Notes
Only stored in your browser.
Attribution
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression
arXiv 2025
Gaperon: A Peppered English-French Generative Language Model Suite
from 2 papers
Benoît Sagot
Éric de la Clergerie
Alessio Devoto
Djamé Seddah
Pasquale Minervini
Rachel Bawden
Rian Touchent
Simone Scardapane
Wissam Antoun
Yu Zhao