tongjingqi / MathTrapView on GitHub
In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a new dataset MATHTRAP‡ by introducing carefully designed logical traps into the problem descriptions of MATH and GSM8K.
60Mar 15, 2025Updated last year

Alternatives and similar repositories for MathTrap

Users that are interested in MathTrap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?