openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
492Updated last week

Related projects

Alternatives and complementary repositories for mle-bench