Cite
Notes
Only stored in your browser.
Attribution
Group Robust Preference Optimization in Reward-free RLHF
arXiv 2024
from 1 papers
Haitham Bou-Ammar
Ilija Bogunovic
Pier Giuseppe Sessa
Shyam Sundhar Ramesh
Viraj Mehta
Yifan Hu