0

SimpleQAVerified

Fresh

SimpleQA Verified is a 1,000-prompt benchmark for evaluating Large Language Model (LLM) short-form factuality.

Type
RL Env
Runtime
ORS
License
unknown
Size
0 tasks
Published
Jan 2026

Cite

Notes

Only stored in your browser.

Public scores on this env

1

1 vf-eval report across 1 model

Open the scoring view →