Cite
Notes
Only stored in your browser.
Attribution
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
arXiv 2025
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models
arXiv 2024
from 2 papers
Nick Haber
Agam Bhatia
Alon Albalak
Anikait Singh
Chase Blagden
Dakota Mahan
Daniel LK Yamins
Duy Phung
Kanishk Gandhi
Logan Cross