Question 1

What is Tau Bench Env?

Accepted Answer

τ-bench: Tool-Agent-User benchmark for conversational agents in customer service domains with user simulation

Question 2

What is the current top score on Tau Bench Env?

Accepted Answer

The top reported score is 100.0% by GPT-4.1 Mini, across 1 model reporting (1 from frontier labs).

Question 3

How can a model improve its Tau Bench Env score?

Accepted Answer

Tools linked to Tau Bench Env on Sophon include Bench ENV RL Env (Prime Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Tau Bench Env under?

Accepted Answer

Tau Bench Env is available under mit.

Tau Bench Env

Top models