ZubinGou / math-evaluation-harness

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐Ÿงฎโœจ
โ˜†145Updated 8 months ago

Alternatives and similar repositories for math-evaluation-harness:

Users that are interested in math-evaluation-harness are comparing it to the libraries listed below