Jan Betley

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

arXiv 2025

Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs

arXiv 2025

Tell me about yourself: LLMs are aware of their learned behaviors

arXiv 2025

Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

arXiv 2024

No known affiliations.

from 4 papers

Owain Evans

founder

Anna Sztyber-Betley

James Chua

Martín Soto

Xuchan Bao

Andy Arditi

Cem Anil

Dami Choi

Daniel Tan

Dylan Feng