Cite
Notes
Only stored in your browser.
Attribution
AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents
arXiv 2026
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements
arXiv 2025
from 2 papers
Abhishek Charnalia
Alisia Lupidi
Derek Dunfield
Despoina Magka
Edan Toledo
Jakob Foerster
Karen Hambardzumyan
Kelvin Niu
Lucia Cipolina-Kun
Martin Josifoski