opencode-lean
OpenCode Lean 4 theorem proving environment via ComposableEnv.
Overview
- Environment ID:
opencode-lean - Tags: lean, theorem-proving, multi-turn, sandbox
Quickstart
uv run vf-eval opencode-lean -n1 -r1 -d -v
Architecture
Uses ComposableEnv with LeanTaskSet + opencode_harness:
- Agent gets
bashandedittools - Proof file at
/tmp/proof.leanwithsorryplaceholder - System prompt instructs compile-iterate workflow
- Scoring by
LeanRubric(checksstate["lean_compiled"])
Dataset Presets
| Preset | Dataset |
|---|---|
deepseek-prover-v1 | DeepSeek-Prover-V1 |
minif2f | MiniF2F |
goedel-pset | Goedel PSet |
Changelog
v0.3.10
- Bump
verifiersto>=0.1.15.dev2for the OpenCode harness config that disables title-generation calls while preserving thesmall_modelpin.
v0.3.9
- Harden sandbox image bootstrap against transient Ubuntu archive mirror sync flakes by adding apt acquire retries.
v0.3.8
- Fix
sandbox_docker_imageprefix. Thecme8364tg000o1139v84cu0cv/...prefix carried over from v0.3.7 is a user-scoped ID that the cluster cannot pull from, causingImagePullBackOffon every sandbox creation. Swap to the team-scopedteam-clyvldofb0000gg1kx39rgzjq/opencode-lean:rl2.
v0.3.7
- Pin
sandbox_docker_imagedefault toteam-clyvldofb0000gg1kx39rgzjq/opencode-lean:rl2. The new image bakes the opencode v1.1.63-rl2 binary into the sandbox so cold sandboxes no longer need to install it at rollout time. README updated to document the change.
v0.3.5
- Add
sandbox_docker_imageargument (defaultteam-clyvldofb0000gg1kx39rgzjq/opencode-lean:rl2), threaded through toLeanTaskSet(#305). Companion to #303 which handled math/cp/science.
v0.3.4
- Bump opencode fork release from
1.1.63-rl1to1.1.63-rl2(PrimeIntellect-ai/opencode#3), explicitly pinned via theopencode_release_versionoverride. Fork release surfaces session-level retry exhaustion as a non-zero exit with a structured stderr dump, so hosted RL rollouts that previously returned silent empty trajectories now produce realAgentErrorentries. Companion default bump in verifiers: PrimeIntellect-ai/verifiers#1184.
v0.3.3
- Bump verifiers to stable
>=0.1.12.
v0.3.2
- Unpin
prime-sandboxesgit source override; use PyPI release>=0.2.19. - Bump verifiers to
>=0.1.13.dev1.
v0.2.1
- Migrate OpenCode fork from
rasdani/opencodetoPrimeIntellect-ai/opencode. Bump release from1.1.63-swe8to1.1.63-rl1via sharedopencode_harnessdefaults (trimmed system prompt for RL training efficiency).
v0.2.0
- Rewrite to composable architecture. Uses
ComposableEnv+LeanTaskSet+opencode_harness. Replaceslean_codeenvironment.
v0.1.0
- Initial release