Rafael Rafailov
- Papers
- 8
Cite
Notes
Only stored in your browser.
8papers
Authored papers
8Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
arXiv 2025
Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
arXiv 2024
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
arXiv 2024
Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels
arXiv 2024
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
arXiv 2024
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
arXiv 2023
Contrastive Preference Learning: Learning from Human Feedback without RL
arXiv 2023
Contrastive Example-Based Control
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 8 papers