Question 1

What is BlicketTest CausalReasoning?

Accepted Answer

Multi-turn causal reasoning environment where an LLM explores a Blicket-detecting machine to identify which objects activate it under a hidden rule

Question 2

What is the current top score on BlicketTest CausalReasoning?

Accepted Answer

The top reported score is 62.0% by Qwen3 30B A3B Instruct 2507, across 3 models reporting (1 from frontier labs).

Question 3

How can a model improve its BlicketTest CausalReasoning score?

Accepted Answer

Tools linked to BlicketTest CausalReasoning on Sophon include Blickettest Causalreasoning RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is BlicketTest CausalReasoning under?

Accepted Answer

BlicketTest CausalReasoning is available under unknown.

BlicketTest CausalReasoning

Score history

Top models

Related tools

Blickettest Causalreasoning RL Env (Community)

FAQ