ZubinGou / math-evaluation-harness

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐Ÿงฎโœจ
โ˜†94Updated 6 months ago

Related projects โ“˜

Alternatives and complementary repositories for math-evaluation-harness