Cite
Notes
Only stored in your browser.
Attribution
InterpBench: Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques
arXiv 2024
from 1 papers
Adrià Garriga-Alonso
Iván Arcuschin
Thomas Kwa
researcher