tongjingqi / MathTrapView on GitHub
In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a new dataset MATHTRAP‡ by introducing carefully designed logical traps into the problem descriptions of MATH and GSM8K.
60Mar 15, 2025Updated 11 months ago

Alternatives and similar repositories for MathTrap

Users that are interested in MathTrap are comparing it to the libraries listed below

Sorting:

Are these results useful?