0

DSBench

Fresh

DSBench is a benchmark designed to evaluate data science agents with realistic tasks. The benchmark includes 466 data analysis tasks and 74 data modeling tasks, sourced from Eloquence and Kaggle competitions.

Type
RL Env
Runtime
ORS
License
unknown
Size
540 tasks
Published
Mar 2026

Cite

Notes

Only stored in your browser.

Contributors

1