0

Wargames RL Env (Layerbrain)

Fresh

Long-horizon real-time game environments for Prime Intellect Verifiers.

Type
RL Env
Publisher
Layerbrain
License
unknown
Size
v0.1.12
Published
Apr 2026

Cite

Notes

Only stored in your browser.

WarGames For Prime RL

This package publishes WarGames as a Prime Intellect Verifiers environment. Prime starts an episode, receives the current game frame, calls the normal computer-control tools, and receives the next frame plus the reward scored by WarGames.

The same game catalog, mission IDs, reward profiles, and trusted scoring probes used by the local runner are used here.

# from the WarGames repo root
uv pip install -e ./environments/prime

Run any shipped eval config:

prime eval run wargames \
  --config environments/prime/configs/doom/eval-map01.toml \
  -n 1 -r 1

Supported Games

GameMissionsEval configRL config
Red Alert297configs/redalert/eval-soviet-01.tomlconfigs/redalert/rl-soviet-01.toml
FlightGear14configs/flightgear/eval-c172p-takeoff.tomlconfigs/flightgear/rl-c172p-takeoff.toml
SuperTuxKart63configs/supertuxkart/eval-lighthouse.tomlconfigs/supertuxkart/rl-lighthouse.toml
0 A.D.390configs/zeroad/eval-arcadia.tomlconfigs/zeroad/rl-arcadia.toml
Freeciv12configs/freeciv/eval-earth-small.tomlconfigs/freeciv/rl-earth-small.toml
Doom204configs/doom/eval-map01.tomlconfigs/doom/rl-map01.toml
SuperTux321configs/supertux/eval-welcome-antarctica.tomlconfigs/supertux/rl-welcome-antarctica.toml
Mindustry27configs/mindustry/eval-veins.tomlconfigs/mindustry/rl-veins.toml
Craftium96configs/craftium/eval-chop-tree.tomlconfigs/craftium/rl-chop-tree.toml
IKEMEN GO9configs/ikemen/eval-kfm.tomlconfigs/ikemen/rl-kfm.toml
Open Surge33configs/opensurge/eval-sunshine-1.tomlconfigs/opensurge/rl-sunshine-1.toml
Quaver70configs/quaver/eval-crossover-beginner.tomlconfigs/quaver/rl-crossover-beginner.toml
Naev128configs/naev/eval-tutorial.tomlconfigs/naev/rl-tutorial.toml

The full game docs live under ../../docs/games.

Config Format

Prime configs select the WarGames task:

game = "redalert"
mission = "redalert.soviet-01.normal"
reward_profile = "protective"
recorder_mode = "none"
max_steps = 500
rollouts_per_example = 8
FieldMeaning
gameWarGames game ID.
missionMission ID from scenarios/<game>/missions/.
reward_profileReward profile ID from scenarios/<game>/profiles/.
recorder_modenone, summary_only, or full. Use none for high-throughput RL.
max_stepsEpisode step budget. Budget stops are marked as truncated.
rollouts_per_examplePrime rollout count for training configs.

Every game ships standard. Red Alert also ships dense, protective, terminal, speedrun, and aggressive_stress_test.

Reward profile schema and per-game fields: ../../docs/reward_profiles.md.

Runtime Contract

For each turn:

  1. Prime sends a model response.
  2. WarGames reads the latest computer-control tool call.
  3. WarGames applies it to the live game.
  4. WarGames returns a JSON status block, the latest frame, and the updated reward.

The model sees rendered game output and the public tool/action history. Trusted game state stays inside WarGames and is only used by the reward profile.

Supported tool calls are the normal WarGames computer controls:

move_mouse
mouse_down
mouse_up
key_down
key_up
scroll
wait

Publishing

The Prime package bundles the WarGames runtime and mission catalog into the published wheel. Publish after main contains the runtime changes:

make -C environments/prime publish

The publish target runs the WarGames harness/evaluation tests, Prime conformance tests, a Prime source archive install check, and then pushes the public layerbrain/wargames environment.