gersteinlab / ML-Bench

The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.09835)
355Updated this week

Related projects

Alternatives and complementary repositories for ML-Bench