0

AgentBench: Evaluate LLMs as Agents

Active

A benchmark designed to evaluate LLMs as Agents

Domain
Coding
License
mit
Published
Aug 2025
Notable for
Benchmark for evaluating Coding.

Cite

Notes

Only stored in your browser.

FAQ

What is AgentBench: Evaluate LLMs as Agents?
A benchmark designed to evaluate LLMs as Agents
What license is AgentBench: Evaluate LLMs as Agents under?
AgentBench: Evaluate LLMs as Agents is available under mit.