openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
517Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for mle-bench