0

AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions

Active

Evaluating abstention across 20 diverse datasets, including questions with unknown answers, underspecification, false premises, subjective interpretations, and outdated information.

Domain
Safeguards
License
mit
Published
Sep 2025
Notable for
Benchmark for evaluating Safeguards.

Cite

Notes

Only stored in your browser.

FAQ

What is AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions?
Evaluating abstention across 20 diverse datasets, including questions with unknown answers, underspecification, false premises, subjective interpretations, and outdated information.
What license is AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions under?
AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions is available under mit.