0

Sudoku Solver RL Env (Prime Intellect)

Fresh

Generated environment

Type
RL Env
License
apache-2.0
Size
v0.1.1
Published
Mar 2026

Cite

Notes

Only stored in your browser.

sudoku-solver

This environment challenges the model to solve classic Sudoku puzzles. It tests logical deduction and constraint satisfaction over multiple turns.

Overview

Domain: games Base Class: MultiTurnEnv Difficulty: medium Task: The model must fill in the empty cells of a 9x9 Sudoku grid such that each row, column, and 3x3 subgrid contains all digits from 1 to 9.

Quickstart

Installation

uv run vf-install sudoku-solver

Usage

import verifiers as vf

env = vf.load_environment("sudoku-solver")
results = env.evaluate_sync(
    client=vf.OpenAI(),
    model="gpt-4.1-mini",
    num_examples=10,
    rollouts_per_example=1
)

Evaluation

Run an evaluation with default settings:

uv run vf-eval sudoku-solver

Configure model and sampling:

uv run vf-eval sudoku-solver \
  -m gpt-4.1-mini \
  -n 20 -r 3 -t 1024 -T 0.7

Environment Arguments

ArgTypeDefaultDescription
num_examplesint1000Number of training examples
num_eval_examplesint100Number of evaluation examples
seedint42Random seed for reproducibility

Metrics

MetricMeaning
rewardPrimary reward signal
format_rewardFormat adherence reward (if applicable)

About

Generated by synthetic-rl-env-creator.

Tags: games