sarahmart / HARDMath

A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low accuracy in solving these problems.
β˜†13Updated last month

Alternatives and similar repositories for HARDMath:

Users that are interested in HARDMath are comparing it to the libraries listed below