Cite
Notes
Only stored in your browser.
Attribution
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
arXiv 2025
Logic.py: Bridging the Gap between LLMs and Constraint Solvers
ARE: Scaling Up Agent Environments and Evaluations
from 3 papers
Amar Budhiraja
Vladislav Vorotilov
Ajay Menon
Amine Benhalloum
Andrey Rusakov
Deepak Nathani
Despoina Magka
Dheeraj Mekala
Dieuwke Hupkes
Emilien Garreau