Cite
Notes
Only stored in your browser.
Attribution
Secrets of RLHF in Large Language Models Part II: Reward Modeling
arXiv 2024
from 1 papers
Binghai Wang
Caishuang Huang
Enyu Zhou
Hang Yan
Jun Zhao
Lixing Shen
Lu Chen
Nuo Xu
Qi Zhang
Rui Zheng