PremiLab-Math/MathCheck

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PremiLab-Math/MathCheck)

PremiLab-Math / MathCheck

[ICLR 2025] Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

☆34

Alternatives and similar repositories for MathCheck

Users that are interested in MathCheck are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ning-mz / SCA-GPS
View on GitHub
Code of ACM MM 2023 Paper: A Symbolic Characters Aware Model for Solving Geometry Problems
☆16Dec 27, 2023Updated 2 years ago
felixludos / alphageometry
View on GitHub
☆13Oct 10, 2024Updated last year
hkust-nlp / Laser
View on GitHub
[ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
☆66May 22, 2025Updated last year
ChengpengLi1003 / DotaMath
View on GitHub
☆30Dec 27, 2024Updated last year
majianz / dl4gps
View on GitHub
[ACL 2026 Main Conference] Paper list for the survey "A Survey of Deep Learning for Geometry Problem Solving"
☆36Sep 14, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wxjiao / InstructMT
View on GitHub
A collection of instruction data and scripts for machine translation.
☆20Sep 23, 2023Updated 2 years ago
zzli2022 / TLDR
View on GitHub
Code for Research Project TLDR
☆26Jul 28, 2025Updated 11 months ago
zhaoyu-li / PyEuclid
View on GitHub
[CAV 2025] PyEuclid: A Versatile Formal Plane Geometry System in Python
☆15Jun 27, 2025Updated last year
MARIO-Math-Reasoning / Super_MARIO
View on GitHub
☆341Jun 5, 2025Updated last year
QwenLM / ProcessBench
View on GitHub
Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"
☆189May 20, 2025Updated last year
VITA-Group / Data-Efficient-Scaling
View on GitHub
[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang
☆14Jan 4, 2024Updated 2 years ago
iiis-ai / IterativeQuestionComposing
View on GitHub
[AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)
☆23Oct 2, 2025Updated 9 months ago
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
ZubinGou / math-evaluation-harness
View on GitHub
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
☆277Apr 26, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chen-judge / UniGeo
View on GitHub
[EMNLP 22] UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression
☆34Dec 7, 2022Updated 3 years ago
tengxiaoliu / XoT
View on GitHub
[EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
☆27Nov 4, 2023Updated 2 years ago
OpenBMB / OlympiadBench
View on GitHub
[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…
☆195Jun 8, 2025Updated last year
conceptmath / conceptmath
View on GitHub
[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …
☆26May 29, 2024Updated 2 years ago
bin123apple / MACM
View on GitHub
[NeurIPS 2024] MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems
☆94Jul 24, 2024Updated last year
THUKElab / DR-CSC
View on GitHub
The repository of EMNLP 2023 "A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check"
☆21Nov 17, 2023Updated 2 years ago
tpgh24 / ag4masses
View on GitHub
Making Google Deepmind's AlphaGeometry accessible to the Masses
☆63Jan 9, 2025Updated last year
protagolabs / odyssey-math
View on GitHub
☆84Jan 25, 2025Updated last year
MARIO-Math-Reasoning / MARIO
View on GitHub
☆28May 8, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hanquansanren / DvD
View on GitHub
[SIGGRAPH Asia 2025] The official implementation of the paper "DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinat…
☆33Mar 10, 2026Updated 4 months ago
stepfun-ai / StepFun-Formalizer
View on GitHub
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion
☆29Aug 19, 2025Updated 11 months ago
stepfun-ai / StepFun-Prover-Preview
View on GitHub
Large language models designed for formal theorem proving through tool-integrated reasoning.
☆33Aug 13, 2025Updated 11 months ago
hkust-nlp / dart-math
View on GitHub
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
☆120Dec 10, 2024Updated last year
chujiezheng / LLM-Extrapolation
View on GitHub
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆75May 20, 2025Updated last year
GAIR-NLP / ReasonEval
View on GitHub
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
☆80Oct 9, 2025Updated 9 months ago
FreedomIntelligence / OVM
View on GitHub
☆74Apr 2, 2024Updated 2 years ago
kyegomez / Reka-Torch
View on GitHub
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆28Jul 13, 2026Updated last week
jinpz / dtv
View on GitHub
The official code release for Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization
☆38Mar 9, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
allenai / Lila
View on GitHub
A unified benchmark for math reasoning
☆90Jan 25, 2023Updated 3 years ago
pkshashank / GFLeanTransfer
View on GitHub
☆14Mar 27, 2024Updated 2 years ago
dxhou / CoAct
View on GitHub
☆32Jul 8, 2024Updated 2 years ago
NJU-RL / GLIDER
View on GitHub
[ICML 2025] Official Implementation of GLIDER
☆73Oct 9, 2025Updated 9 months ago
THUDM / ChatGLM-Math
View on GitHub
☆82Apr 18, 2024Updated 2 years ago
RUCAIBox / JiuZhang3.0
View on GitHub
The code and data for the paper JiuZhang3.0
☆49May 26, 2024Updated 2 years ago
fzyzcjy / ai_math_paper_list
View on GitHub
AI for Mathematics Paper List
☆17Jan 14, 2025Updated last year