sarahmart / HARDMath

A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low accuracy in solving these problems.
β˜†17Updated 3 months ago

Alternatives and similar repositories for HARDMath

Users that are interested in HARDMath are comparing it to the libraries listed below

Sorting: