0

AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

Active

Tests whether AI agents can perform real-world time-consuming tasks on the web.

Domain
Assistants
License
mit
Published
Oct 2024
Notable for
Benchmark for evaluating Assistants.

Cite

Notes

Only stored in your browser.

FAQ

What is AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks??
Tests whether AI agents can perform real-world time-consuming tasks on the web.
What license is AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? under?
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? is available under mit.