Taubench
Frontier
tau-bench challenges agents to coordinate, guide, and assist users in achieving shared objectives across complex enterprise domains.
- Domain
- rl-env
- License
- unknown
- Published
- Jan 2026
Cite
Notes
Only stored in your browser.
Top score 61.2 by GPT-4o - 12 models reporting (7 frontier)
Score history
9Top models
12FAQ
- What is Taubench?
- tau-bench challenges agents to coordinate, guide, and assist users in achieving shared objectives across complex enterprise domains.
- What is the current top score on Taubench?
- The top reported score is 61.2 by GPT-4o, across 12 models reporting (7 from frontier labs).
- What license is Taubench under?
- Taubench is available under unknown.
