AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
Active
Tests whether AI agents can perform real-world time-consuming tasks on the web.
- Publisher
- Tel Aviv University
- Domain
- Assistants
- License
- mit
- Published
- Oct 2024
- Notable for
- Benchmark for evaluating Assistants.
Cite
Notes
Only stored in your browser.
FAQ
- What is AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks??
- Tests whether AI agents can perform real-world time-consuming tasks on the web.
- What license is AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? under?
- AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? is available under mit.