Cite
Notes
Only stored in your browser.
Attribution
Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers
arXiv 2025
Controllable Context Sensitivity and the Knob Behind It
arXiv 2024
from 2 papers
Adam Karvonen
Arnab Sen Sharma
Chris Wendler
Clément Dumas
Daniel Wen
Euan Ong
Giovanni Monea
James Chua
Kevin Du
Kit Fraser-Taliente