ZubinGou / math-evaluation-harness

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐Ÿงฎโœจ
โ˜†208Updated last year

Alternatives and similar repositories for math-evaluation-harness:

Users that are interested in math-evaluation-harness are comparing it to the libraries listed below