Ml Dev Bench
ML-Dev-Bench: A benchmark for testing AI agents on machine learning development tasks including model implementation, training, debugging, and optimization.
- Domain
- agent-eval
- Published
- Nov 2025
Cite
Notes
Only stored in your browser.
FAQ
- What is Ml Dev Bench?
- ML-Dev-Bench: A benchmark for testing AI agents on machine learning development tasks including model implementation, training, debugging, and optimization.