AgentBench: Evaluate LLMs as Agents
Active
A benchmark designed to evaluate LLMs as Agents
- Publisher
- Tsinghua University
- Domain
- Coding
- License
- mit
- Published
- Aug 2025
- Notable for
- Benchmark for evaluating Coding.
Cite
Notes
Only stored in your browser.
FAQ
- What is AgentBench: Evaluate LLMs as Agents?
- A benchmark designed to evaluate LLMs as Agents
- What license is AgentBench: Evaluate LLMs as Agents under?
- AgentBench: Evaluate LLMs as Agents is available under mit.