Cite
Notes
Only stored in your browser.
Attribution
REBEL: Reinforcement Learning via Regressing Relative Rewards
arXiv 2024
Inverse Reinforcement Learning without Reinforcement Learning
arXiv 2023
The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms
from 3 papers
Gokul Swamy
Sanjiban Choudhury
Aarti Singh
Anirudh Vemula
Jason D. Lee
Jonathan D. Chang
Kianté Brantley
Owen Oertell
Thorsten Joachims
Wen Sun