Question 1

What is Agent Dojo?

Accepted Answer

Benchmark for agent robustness against prompt injection attacks in tool-use scenarios

Question 2

What is the current top score on Agent Dojo?

Accepted Answer

The top reported score is 100.0% by GPT-4.1 Mini, across 2 models reporting (2 from frontier labs).

Question 3

How can a model improve its Agent Dojo score?

Accepted Answer

Tools linked to Agent Dojo on Sophon include Agent DOJO RL Env (Prime Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Agent Dojo under?

Accepted Answer

Agent Dojo is available under unknown.

Agent Dojo

Score history