Eric J. Michaud

Papers: 5

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

5papers

Authored papers

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

arXiv 2024

2024

The Geometry of Concepts: Sparse Autoencoder Feature Structure

arXiv 2024

2024

Opening the AI black box: program synthesis via mechanistic interpretability

arXiv 2024

2024

Not All Language Model Features Are Linear

arXiv 2024

2024

Towards Understanding Grokking: An Effective Theory of Representation Learning

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 5 papers

Max Tegmark

professor

Isaac Liao

Joshua Engels

Ziming Liu

Aaron Mueller

Anish Mudide

Can Rager

Chloe Loughridge

David Bau

David D. Baek