Nicholas Joseph

Founding team member and technical staff at Anthropic; previously at OpenAI on GPT-3.

Role: researcher
Currently at: Anthropic
Scholar: scholar.google.com/citations
Papers: 6

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/citations

Attribution policy →

6papers

Authored papers

6

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

arXiv 2023

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

preprint

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

arXiv 2022

Discovering Language Model Behaviors with Model-Written Evaluations

arXiv 2022

Constitutional AI: Harmlessness from AI Feedback

arXiv 2022

Evaluating Large Language Models Trained on Code

preprint

Affiliations

Currently at

researcher · frontier lab

Previously

OpenAIfrontier lab

Frequent co-authors

10

from 6 papers

Jared Kaplan

co-founder / Chief Science Officer

6 shared papers

Sam McCandlish

founder

6 shared papers

Anna Chen

researcher

5 shared papers

Danny Hernandez

researcher

5 shared papers

Dario Amodei

CEO

5 shared papers

Jackson Kernion

researcher

5 shared papers

Sheer El Showk

researcher

5 shared papers

Zac Hatfield-Dodds

researcher

5 shared papers

Amanda Askell

researcher

4 shared papers

Andy Jones

researcher

4 shared papers