Cite
Notes
Only stored in your browser.
Attribution
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
arXiv 2024
from 1 papers
Haoran Sun
Hua Wu
Shuohuan Wang
Yekun Chai
Yu Sun