Milad Aghajohari

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

The Markovian Thinker

arXiv 2025

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

arXiv 2025

Learning Robust Social Strategies with Large Language Models

arXiv 2025

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

arXiv 2024

No known affiliations.

from 4 papers

Aaron Courville

Amirhossein Kazemnejad

Siva Reddy

Alessandro Sordoni

Aditi Khandelwal

Arkil Patel

Austin Kraft

Benno Krojer

Dereck Piche

Dongchan Shin

researcher