0

TAU 2 Synth RL Env (Community)

Fresh

τ²-bench with custom synthetic domains (library, fitness_gym, tech_support, cloud_incident_response, daily_planner, ev_charging_support)

Type
RL Env
License
apache-2.0
Published
Apr 2026

Cite

Notes

Only stored in your browser.

tau2-synth

Overview

  • Environment ID: tau2-synth
  • Short description: tau2-bench with custom synthetic domains.
  • Tags: tool-use, customer-service, multi-domain, user-simulation, synthetic
  • Source: mikasenghaas/tau2-synth (branch synth)

Domains

DomainDescription
libraryLibrary management
fitness_gymFitness gym management
tech_supportTech support
cloud_incident_responseCloud incident response
daily_plannerDaily planner
ev_charging_supportEV charging support

Setup

uv pip install -e ./environments/tau2_synth

Quickstart

uv run vf-eval tau2-synth -a '{"domain": "library"}' -d -v -n1 -r1
uv run vf-eval tau2-synth -a '{"domain": "cloud_incident_response"}' -d -v -n1 -r1

Architecture

environments/tau2_synth/
├── pyproject.toml       # depends on tau2 package from mikasenghaas/tau2-synth
├── tau2_synth.py        # environment implementation
└── README.md

Domain data is downloaded at runtime from mikasenghaas/tau2-synth (branch synth) into the default DATA_DIR resolved by the tau2 package.

Environment Arguments

ArgTypeDefaultDescription
domainstr"library"Domain to evaluate (see table above)
user_modelstr"custom_openai/openai/gpt-4.1"LLM model for user simulator
user_argsdict | NoneNoneAdditional LLM arguments for the user simulator
user_base_urlstr"https://api.pinference.ai/api/v1"Base URL for the user model
user_api_key_varstr"PRIME_API_KEY"Environment variable for the user model API key
max_stepsint200Maximum conversation steps
max_errorsint10Maximum tool execution errors before termination
max_workersint128Maximum number of workers for the thread pool

Changelog

v0.2.1 (2026-05-14)

  • Default user simulator requests now use Pinference (https://api.pinference.ai/api/v1) with PRIME_API_KEY, Prime config fallback/team-header auth, and the custom_openai/openai/gpt-4.1 model name.

v0.2.0 (Mar 30, 2026)

  • Initial release of tau2-synth with 6 synthetic domains: library, fitness_gym, tech_support, cloud_incident_response, daily_planner, ev_charging_support
  • Domain data downloaded at runtime from mikasenghaas/tau2-synth
  • User simulator with configurable model, base URL, and API key