Cite
Notes
Only stored in your browser.
Attribution
GameDevBench: Evaluating Agentic Capabilities Through Game Development
arXiv 2026
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
arXiv 2024
The Impact of Element Ordering on LM Agent Performance
from 3 papers
Ameet Talwalkar
Chris Donahue
Alexander Wang
Amaad Martin
Arnav Yayavaram
Boxuan Li
Frank F. Xu
Graham Neubig
professor
Hao Yang Lu
Kritanjali Jain