Question 1

What is AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents?

Accepted Answer

Assesses whether AI agents can be hijacked by malicious third parties using prompt injections in simple environments such as a workspace or travel booking app.

Question 2

How can a model improve its AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents score?

Accepted Answer

Tools linked to AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents on Sophon include Agent DOJO RL Env (Prime Community), Agent DOJO RL Env (Prime Intellect), CASA House RL Env (Community), BE LIKE RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.

Question 3

What license is AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents under?

Accepted Answer

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents is available under mit.

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

Related tools

Agent DOJO RL Env (Prime Community)

Agent DOJO RL Env (Prime Intellect)

CASA House RL Env (Community)

BE LIKE RL Env (Community)

FAQ