Cite
Notes
Only stored in your browser.
Attribution
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
arXiv 2023
from 1 papers
Ruoyu Sun
Yang Yu
Yushun Zhang
Zhi-Quan Luo
Zhihang Lin
Ziniu Li