Alex Tamkin
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models
arXiv 2024
Eliciting Human Preferences with Language Models
arXiv 2023
Codebook Features: Sparse and Discrete Interpretability for Neural Networks
arXiv 2023
C5T5: Controllable Generation of Organic Molecules with Transformers
c5t5-controllable-generation-of-organic-1
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers
Belinda Z. Li
Buck Shlegeris
Carson Denison
Daniel Rothchild
David Duvenaud
Ethan Perez
Evan Hubinger
Fazl Barez
Jacob Andreas
Jared Kaplan
co-founder / Chief Science Officer