0

Binary Audit

An open-source benchmark for evaluating AI agents' ability to find backdoors hidden in compiled binaries.

Domain
agent-eval
Published
Dec 2025

Cite

Notes

Only stored in your browser.

FAQ

What is Binary Audit?
An open-source benchmark for evaluating AI agents' ability to find backdoors hidden in compiled binaries.