Oleg Y. Rogov
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders
arXiv 2025
Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models
arXiv 2025
CLEAR: Character Unlearning in Textual and Visual Modalities
arXiv 2024
Certification of Speaker Recognition Models to Additive Perturbations
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers