Rishabh Agarwal

Papers: 6

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

6papers

Authored papers

Process Reward Models That Think

arXiv 2025

2025

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

arXiv 2024

2024

Training Language Models to Self-Correct via Reinforcement Learning

arXiv 2024

2024

Bigger, Better, Faster: Human-level Atari with human-level efficiency

arXiv 2023

2023

Revisiting Bellman Errors for Offline Model Selection

arXiv 2023

2023

Deep Reinforcement Learning at the Edge of the Statistical Precipice

NeurIPS 2021 12

2021

Affiliations

No known affiliations.

Frequent co-authors

from 6 papers

Aaron Courville

Max Schwarzer

Pablo Samuel Castro

Aleksandra Faust

Arian Hosseini

Avi Singh

Aviral Kumar

Colton Bishop

Cosmin Paduraru

Daniel de Marchi