SimpleQAVerified
Fresh
SimpleQA Verified is a 1,000-prompt benchmark for evaluating Large Language Model (LLM) short-form factuality.
- Type
- RL Env
- Publisher
- General Reasoning
- Runtime
ORS- License
- unknown
- Size
- 0 tasks
- Published
- Jan 2026
Cite
Notes
Only stored in your browser.
Public scores on this env
11 vf-eval report across 1 model