0

AgentThreatBench: Evaluating LLM Agent Resilience to OWASP Agentic Threats

Active

Evaluates LLM agents against the OWASP Top 10 for Agentic Applications (2026), measuring both task utility and security resilience across memory poisoning, autonomy hijacking, and data exfiltration scenarios.

Domain
Safeguards
License
mit
Published
May 2026
Notable for
Benchmark for evaluating Safeguards.

Cite

Notes

Only stored in your browser.

FAQ

What is AgentThreatBench: Evaluating LLM Agent Resilience to OWASP Agentic Threats?
Evaluates LLM agents against the OWASP Top 10 for Agentic Applications (2026), measuring both task utility and security resilience across memory poisoning, autonomy hijacking, and data exfiltration scenarios.
What license is AgentThreatBench: Evaluating LLM Agent Resilience to OWASP Agentic Threats under?
AgentThreatBench: Evaluating LLM Agent Resilience to OWASP Agentic Threats is available under mit.