openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
589Updated this week

Alternatives and similar repositories for mle-bench:

Users that are interested in mle-bench are comparing it to the libraries listed below