BE LIKE RL Env (Community)
Fresh
Evil AgentDojo: Training models to be MORE susceptible to prompt injections (inverted reward signal)
- Type
- RL Env
- License
- unknown
- Size
- v0.1.3
- Published
- Nov 2025
Cite
Notes
Only stored in your browser.
Lift evidence
1| Eval | Tools known to lift | Source paper |
|---|---|---|
| AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents | BE LIKE RL Env (Community) | - |