Adrià Garriga-Alonso
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3InterpBench: Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques
arXiv 2024
Towards Automated Circuit Discovery for Mechanistic Interpretability
towards-automated-circuit-discovery-for
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
TMLR
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers