0

Saurav Kadavath

Researcher at Anthropic on finetuning and alignment; co-author of Constitutional AI and "Language Models (Mostly) Know What They Know".

Role
researcher
Currently at
Anthropic
Papers
8

Cite

Notes

Only stored in your browser.