Rafael Rafailov

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

arXiv 2025

2025

Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels

arXiv 2024

2024

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

arXiv 2024

2024

Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

arXiv 2024

2024

Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

arXiv 2024

2024

Contrastive Preference Learning: Learning from Human Feedback without RL

arXiv 2023

2023

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

arXiv 2023

2023

Contrastive Example-Based Control

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Chelsea Finn

Anikait Singh

Chenhang Cui

Dorsa Sadigh

Huaxiu Yao

Kanishk Gandhi

Sergey Levine

professor

Tianhe Yu

Yiyang Zhou

Abhishek Padalkar