dvlab-research / MR-GSM8K

Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
41Updated 4 months ago

Related projects

Alternatives and complementary repositories for MR-GSM8K