Cite
Notes
Only stored in your browser.
Attribution
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories
arXiv 2024
AgentBench: Evaluating LLMs as Agents
arXiv 2023
from 2 papers
Aohan Zeng
Ashish Sabharwal
Ben Bogin
Chenhui Zhang
Erin Bransom
Hanchen Zhang
Hangliang Ding
Hanyu Lai
Hao Yu
Huan Sun