Cite
Notes
Only stored in your browser.
Attribution
MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control
arXiv 2024
Contrastive Preference Learning: Learning from Human Feedback without RL
arXiv 2023
Learning Optimal Advantage from Preferences and Mistaking it for Reward
from 3 papers
Scott Niekum
Anca Dragan
Chelsea Finn
Dongyoon Hahm
Dorsa Sadigh
Harshit Sikchi
Joey Hejna
June Suk Choi
Juyong Lee
Kimin Lee