ZubinGou / math-evaluation-harnessView on GitHub
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐Ÿงฎโœจ
โ˜†273Apr 26, 2024Updated last year

Alternatives and similar repositories for math-evaluation-harness

Users that are interested in math-evaluation-harness are comparing it to the libraries listed below

Sorting:

Are these results useful?