0

Mind2Web: Towards a Generalist Agent for the Web

Active

A dataset for developing and evaluating generalist agents for the web that can follow language instructions to complete complex tasks on any website.

Domain
Assistants
License
mit
Published
Mar 2025
Notable for
Benchmark for evaluating Assistants.

Cite

Notes

Only stored in your browser.

Related tools

3
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Mind2Web: Towards a Generalist Agent for the Web?
A dataset for developing and evaluating generalist agents for the web that can follow language instructions to complete complex tasks on any website.
How can a model improve its Mind2Web: Towards a Generalist Agent for the Web score?
Tools linked to Mind2Web: Towards a Generalist Agent for the Web on Sophon include Mind2web RL Env (Browserbase), Mind2web RL Env (Community), ANTI BOT RL Env (Browserbase) - RL environments, datasets, and scaffolds that target this eval.
What license is Mind2Web: Towards a Generalist Agent for the Web under?
Mind2Web: Towards a Generalist Agent for the Web is available under mit.