sarahmart / HARDMathView on GitHub
A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low accuracy in solving these problems.
26Feb 14, 2025Updated last year

Alternatives and similar repositories for HARDMath

Users that are interested in HARDMath are comparing it to the libraries listed below

Sorting:

Are these results useful?