0

BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions

Active

Reading comprehension dataset that queries for complex, non-factoid information, and require difficult entailment-like inference to solve.

Domain
Reasoning
License
mit
Published
May 2026
Notable for
Benchmark for evaluating Reasoning.

Cite

Notes

Only stored in your browser.

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions?
Reading comprehension dataset that queries for complex, non-factoid information, and require difficult entailment-like inference to solve.
How can a model improve its BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions score?
Tools linked to BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions on Sophon include Boolq RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions under?
BoolQ: Exploring the Surprising Difficulty of Natural Yes/No Questions is available under mit.