AgentThreatBench: Evaluating LLM Agent Resilience to OWASP Agentic Threats
Active
Evaluates LLM agents against the OWASP Top 10 for Agentic Applications (2026), measuring both task utility and security resilience across memory poisoning, autonomy hijacking, and data exfiltration scenarios.
- Domain
- Safeguards
- License
- mit
- Published
- May 2026
- Notable for
- Benchmark for evaluating Safeguards.
Cite
Notes
Only stored in your browser.
FAQ
- What is AgentThreatBench: Evaluating LLM Agent Resilience to OWASP Agentic Threats?
- Evaluates LLM agents against the OWASP Top 10 for Agentic Applications (2026), measuring both task utility and security resilience across memory poisoning, autonomy hijacking, and data exfiltration scenarios.
- What license is AgentThreatBench: Evaluating LLM Agent Resilience to OWASP Agentic Threats under?
- AgentThreatBench: Evaluating LLM Agent Resilience to OWASP Agentic Threats is available under mit.