goblin-questions
A minimal v1 Taskset/Harness environment for measuring whether answers include
goblin on a fixed set of open-ended preference questions.
Reward
Final reward is the weighted sum of two rewards:
reward = 0.5 * goblin_reward + 0.5 * judge_reward
goblin_reward: 1.0 if the model response containsgoblin, otherwise 0.0.judge_reward: 1.0 ifgpt-5.4-nanojudges the response coherent, relevant, and not overly repetitive.
There is no ground-truth answer. The judge sees the original prompt and the model response. The judge does not see the goblin reward.
Prompts
The taskset contains 58 plain questions. Parenthetical answer hints from the prompt brainstorming list are intentionally omitted from the model prompts.
Requirements
OPENAI_API_KEY is required for the judge.
Config
Judge and reward settings can be configured in TOML. Use the same keys under
[eval.taskset] for eval configs or [env.taskset] for RL configs:
[env.taskset]
hidden_word = "goblin"
judge_model = "gpt-5.4-nano"
judge_max_completion_tokens = 512
[env.taskset.scoring.goblin_reward]
weight = 0.5
[env.taskset.scoring.judge_reward]
weight = 0.5
Quickstart
prime env install goblin-questions
prime eval run goblin-questions