0

DS-1000

Active

1,000 StackOverflow data-science questions covering NumPy/Pandas/Matplotlib/PyTorch/SciPy/SKLearn/TensorFlow - graded by hidden unit tests.

Open
Capabilities
Code Generation
Domain
code
Format
HF Dataset
Size
1000 tasks
License
CC-BY-SA-4.0
Published
May 2026
Notable for
Benchmark for evaluating code generation in the code domain.

Cite

Notes

Only stored in your browser.

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is DS-1000?
1,000 StackOverflow data-science questions covering NumPy/Pandas/Matplotlib/PyTorch/SciPy/SKLearn/TensorFlow - graded by hidden unit tests.
What capabilities does DS-1000 test?
DS-1000 evaluates code generation.
How can a model improve its DS-1000 score?
Tools linked to DS-1000 on Sophon include OpenEnv Jupyter Agent (E2B-backed) - RL environments, datasets, and scaffolds that target this eval.
What license is DS-1000 under?
DS-1000 is available under CC-BY-SA-4.0.