Anca Dragan

Papers: 6

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

6papers

Authored papers

AssistanceZero: Scalably Solving Assistance Games

arXiv 2025

2025

Correlated Proxies: A New Definition and Improved Mitigation for Reward Hacking

arXiv 2024

2024

Learning to Assist Humans without Inferring Rewards

arXiv 2024

2024

The Effective Horizon Explains Deep RL Performance in Stochastic Environments

arXiv 2023

2023

Learning Optimal Advantage from Preferences and Mistaking it for Reward

arXiv 2023

2023

Inferring Rewards from Language in Context

ACL 2022 5

2022

Affiliations

No known affiliations.

Frequent co-authors

from 6 papers

Cassidy Laidlaw

3 shared papers

Stuart Russell

professor

2 shared papers

Banghua Zhu

professor

1 shared paper

Benjamin Eysenbach

1 shared paper

Dan Klein

1 shared paper

Daniel Fried

professor

Dylan Feng

Eli Bronstein

Evan Ellis

Jessy Lin