Binary Audit
An open-source benchmark for evaluating AI agents' ability to find backdoors hidden in compiled binaries.
- Domain
- agent-eval
- Published
- Dec 2025
Cite
Notes
Only stored in your browser.
FAQ
- What is Binary Audit?
- An open-source benchmark for evaluating AI agents' ability to find backdoors hidden in compiled binaries.