Cite
Notes
Only stored in your browser.
Attribution
Compositional preference models for aligning LMs
arXiv 2023
Aligning Language Models with Preferences through f-divergence Minimization
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
TMLR
from 3 papers
Jos Rozen
Dongyoung Go
Marc Dymetman
Tomasz Korbak
Aarohi Srivastava
researcher
Abhinav Rastogi
Abhishek Rao
Abu Awal Md Shoeb
Abubakar Abid
Adam Fisch