openai / mle-benchView on GitHub
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
1,329Feb 26, 2026Updated this week

Alternatives and similar repositories for mle-bench

Users that are interested in mle-bench are comparing it to the libraries listed below

Sorting:

Are these results useful?