Can Rager

Papers: 6

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

6papers

Authored papers

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability

arXiv 2025

2025

NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals

arXiv 2024

2024

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

arXiv 2024

2024

Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models

arXiv 2024

2024

Structured World Representations in Maze-Solving Transformers

arXiv 2023

2023

A Configurable Library for Generating and Manipulating Maze Datasets

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 6 papers

Samuel Marks

David Bau

Aaron Mueller

Adam Karvonen

Alex F. Spies

Cecilia Diniz Behn

Chris Mathwin

Dan Valentine

Guillaume Corlouer

Jannik Brinkmann