0

DIFF Bench RL Env (Community)

Fresh

Benchmark for evaluating agents on Slack, Linear, Box, Calendar via Bash & Python

Type
RL Env
Runtime
multi-turn
License
unknown
Size
v0.1.16
Published
Feb 2026

Cite

Notes

Only stored in your browser.