Polina Druzhinina

Cite

Notes

Only stored in your browser.

Attribution

1papers

Authored papers

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

arXiv 2025

No known affiliations.

from 1 papers

Alexey Dontsov

Andrey Galichin

Anton Razzhigaev

Elena Tutubalina

Ivan Oseledets

Oleg Y. Rogov