Bartosz Cywiński
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation
arXiv 2026
SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders
arXiv 2025
Eliciting Secret Knowledge from Language Models
arXiv 2025
Towards eliciting latent knowledge from LLMs with mechanistic interpretability
arXiv 2025
GUIDE: Guidance-based Incremental Learning with Diffusion Models
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers