Alexey Dontsov

Cite

Notes

Only stored in your browser.

Attribution

2papers

Authored papers

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

arXiv 2025

CLEAR: Character Unlearning in Textual and Visual Modalities

arXiv 2024

No known affiliations.

from 2 papers

Elena Tutubalina

Ivan Oseledets

Oleg Y. Rogov

Aibek Alanov

Alexey Zhavoronkin

Andrey Galichin

Anton Razzhigaev

Boris Mikheev

Denis Bobkov

Dmitrii Korzh