0

HellaSwag: Commonsense Event Continuation

Active

Tests models' commonsense reasoning abilities by asking them to select the most likely next step or continuation for a given everyday situation.

Domain
Reasoning
License
mit
Published
May 2026
Notable for
Benchmark for evaluating Reasoning.

Cite

Notes

Only stored in your browser.

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is HellaSwag: Commonsense Event Continuation?
Tests models' commonsense reasoning abilities by asking them to select the most likely next step or continuation for a given everyday situation.
How can a model improve its HellaSwag: Commonsense Event Continuation score?
Tools linked to HellaSwag: Commonsense Event Continuation on Sophon include Hellaswag RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is HellaSwag: Commonsense Event Continuation under?
HellaSwag: Commonsense Event Continuation is available under mit.