What license is APE: Attempt to Persuade Eval under?

APE: Attempt to Persuade Eval is available under mit.

APE: Attempt to Persuade Eval

Active

Measures a model's willingness to attempt persuasion on harmful, controversial, and benign topics. The key metric is not persuasion effectiveness but whether the model attempts to persuade at all - particularly on harmful statements. Uses a multi-model

Open

Publisher: FAR.AI
Domain: Safeguards
License: mit
Published: Mar 2026
Notable for: Benchmark for evaluating Safeguards.
Canonical: github.com/UKGovernmentBEIS/inspect_evals/tree/main/src/inspect_evals/ape

Cite

Notes

Only stored in your browser.

Attribution

README: github.com/UKGovernmentBEIS/inspect_evals/blob/main/src/inspect_evals/ape/README.mdMIT

Attribution policy →

FAQ

What is APE: Attempt to Persuade Eval?: Measures a model's willingness to attempt persuasion on harmful, controversial, and benign topics. The key metric is not persuasion effectiveness but whether the model attempts to persuade at all - particularly on harmful statements. Uses a multi-model
What license is APE: Attempt to Persuade Eval under?: APE: Attempt to Persuade Eval is available under mit.