Jan Betley
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs
arXiv 2025
Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs
arXiv 2025
Tell me about yourself: LLMs are aware of their learned behaviors
arXiv 2025
Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers