Cite
Notes
Only stored in your browser.
Attribution
Escalation Risks from Language Models in Military and Diplomatic Decision-Making
arXiv 2024
Welfare Diplomacy: Benchmarking Language Model Cooperation
arXiv 2023
SuperHF: Supervised Iterative Learning from Human Feedback
from 3 papers
Alan Chan
Anka Reuel
Chandler Smith
Gitta Kutyniok
Hannah Erlebach
Jacquelyn Schneider
Jesse Clifton
Juan-Pablo Rivera
Kush Bhatia
Lewis Hammond