Michael Noukhovitch
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Olmo 3
arXiv 2025
Learning Robust Social Strategies with Large Language Models
arXiv 2025
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
arXiv 2024
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
arXiv 2024
Language Model Alignment with Elastic Reset
language-model-alignment-with-elastic-reset
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers
Aaron Courville
Arian Hosseini
Shengyi "Costa" Huang
researcher
Akari Asai
Akshita Bhagia
Alexander Wettig
researcher
Ali Farhadi
CEO
Alisa Liu
researcher
Allyson Ettinger
Aman Rangapur