W. Bradley Knox

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control

arXiv 2024

Learning Optimal Advantage from Preferences and Mistaking it for Reward

arXiv 2023

Contrastive Preference Learning: Learning from Human Feedback without RL

arXiv 2023

No known affiliations.

from 3 papers

Scott Niekum

Anca Dragan

Chelsea Finn

Dongyoon Hahm

Dorsa Sadigh

Harshit Sikchi

Joey Hejna

June Suk Choi

Juyong Lee

Kimin Lee