0

Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities

Active

A benchmark for evaluating the capabilities of LLM agents in cyber offense.

Domain
Cybersecurity
License
mit
Published
Sep 2025
Notable for
Benchmark for evaluating Cybersecurity.

Cite

Notes

Only stored in your browser.

FAQ

What is Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities?
A benchmark for evaluating the capabilities of LLM agents in cyber offense.
What license is Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities under?
Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities is available under mit.