Oleg Y. Rogov

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

arXiv 2025

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

arXiv 2025

CLEAR: Character Unlearning in Textual and Visual Modalities

arXiv 2024

Certification of Speaker Recognition Models to Additive Perturbations

arXiv 2024

No known affiliations.

from 4 papers

Ivan Oseledets

Dmitrii Korzh

Elena Tutubalina

Alexey Dontsov

Elvir Karimov

Aibek Alanov

Alexander Panchenko

Alexey Zhavoronkin

Andrey Galichin

Anton Razzhigaev