Collin Burns

AI alignment researcher; lead author of OpenAI's Weak-to-Strong Generalization; previously at Anthropic and now at the Center for AI Standards and Innovation.

Role: researcher
Currently at: Center for AI Standards and Innovation (CAISI)
Twitter: twitter.com/collinburns4
GitHub: github.com/collin-burns
Scholar: scholar.google.com/citations
Papers: 6

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/citations

Attribution policy →

6papers

Authored papers

Measuring Mathematical Problem Solving With the MATH Dataset

NeurIPS

2021

Measuring Coding Challenge Competence With APPS

arXiv 2021

2021

CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review

arXiv 2021

2021

Measuring Massive Multitask Language Understanding

ICLR

2020

Aligning AI With Shared Human Values

arXiv 2020

2020

Interpreting Black Box Models via Hypothesis Testing

arXiv 2019

2019

Affiliations

Currently at

Center for AI Standards and Innovation (CAISI)

researcher · government

Previously

Anthropicfrontier lab OpenAIfrontier lab University of California, Berkeleyuniversity lab

Frequent co-authors

from 6 papers

Dan Hendrycks

director

5 shared papers

Dawn Song

professor

4 shared papers

Jacob Steinhardt

founder

4 shared papers

Steven Basart

researcher

4 shared papers

Akul Arora

researcher

2 shared papers

Mantas Mazeika

researcher

2 shared papers

Saurav Kadavath

researcher

2 shared papers

Andrew Critch

1 shared paper

Andy Zou

founder

1 shared paper

Anya Chen

1 shared paper