Sphere-AI-Lab/FormalMATH-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Sphere-AI-Lab/FormalMATH-Bench)

Sphere-AI-Lab / FormalMATH-Bench

Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>

☆75

Alternatives and similar repositories for FormalMATH-Bench

Users that are interested in FormalMATH-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Sphere-AI-Lab / PEFT-Arena
View on GitHub
Official repository of PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective
☆26Jun 13, 2026Updated last month
MoonshotAI / CombiBench
View on GitHub
☆52Jun 15, 2026Updated last month
Lizn-zn / NeqLIPS
View on GitHub
NeqLIPS: a powerful Olympiad-level inequality prover
☆40Sep 7, 2025Updated 10 months ago
Miracle-Messi / Isa-AutoFormal
View on GitHub
☆17Oct 27, 2024Updated last year
Sphere-AI-Lab / pion
View on GitHub
☆36Jul 2, 2026Updated 2 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RickySkywalker / LeanOfThought-Official
View on GitHub
This is the official implementation for MA-LoT.
☆20Aug 4, 2025Updated 11 months ago
pkshashank / GFLeanTransfer
View on GitHub
☆14Mar 27, 2024Updated 2 years ago
MoonshotAI / Kimina-Prover-Preview
View on GitHub
Technical report of Kimina-Prover Preview.
☆372Jul 10, 2025Updated last year
roozbeh-mohit / IMO-Steps
View on GitHub
☆31Jul 16, 2025Updated last year
loganrjmurphy / LeanEuclid
View on GitHub
LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.
☆138Nov 25, 2025Updated 7 months ago
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
Goedel-LM / Goedel-Prover
View on GitHub
☆237Apr 4, 2025Updated last year
project-numina / kimina-lean-server
View on GitHub
Kimina Lean server (+ client SDK)
☆206Jan 11, 2026Updated 6 months ago
Sphere-AI-Lab / SGP-RL
View on GitHub
Implementation of <Symbolic Graphics Programming with Large Language Models>
☆38Sep 14, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zhaoyu-li / DL4TP
View on GitHub
[COLM 2024] A Survey on Deep Learning for Theorem Proving
☆227May 28, 2025Updated last year
albertqjiang / MMA
View on GitHub
The official repository for the paper Multilingual Mathematical Autoformalization
☆39May 20, 2024Updated 2 years ago
kfdong / STP
View on GitHub
The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"
☆121Mar 28, 2025Updated last year
CLR-Lab / SimKO
View on GitHub
SimKO: Simple Pass@K Policy Optimization
☆31Oct 24, 2025Updated 8 months ago
haoyuzhao123 / LeanIneqComp
View on GitHub
An inequality benchmark for theorem proving
☆22Feb 1, 2026Updated 5 months ago
Purewhite2019 / rethinking_autoformalization
View on GitHub
[ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approach
☆31May 20, 2025Updated last year
trishullab / itp-interface
View on GitHub
Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…
☆19Jul 10, 2026Updated last week
ByteDance-Seed / Seed-Prover
View on GitHub
☆435Feb 13, 2026Updated 5 months ago
Sphere-AI-Lab / poet
View on GitHub
Implementation for POET and POET-X for LLM pretraining
☆38Jun 9, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fzyzcjy / ai_math_paper_list
View on GitHub
AI for Mathematics Paper List
☆17Jan 14, 2025Updated last year
multimodal-art-projection / CriticLean
View on GitHub
☆50Aug 5, 2025Updated 11 months ago
rookie-joe / PDA
View on GitHub
☆36Jan 10, 2025Updated last year
hkust-nlp / RL-Verifier-Robustness
View on GitHub
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
☆24Oct 7, 2025Updated 9 months ago
Huawei-AI4Math / ProofFlow
View on GitHub
☆23Jun 28, 2026Updated 3 weeks ago
lean-dojo / ReProver
View on GitHub
Retrieval-Augmented Theorem Provers for Lean
☆332Jan 30, 2025Updated last year
frenzymath / herald_translator
View on GitHub
☆33Jun 12, 2025Updated last year
Sphere-AI-Lab / fda
View on GitHub
Implementation of <Model Merging with Functional Dual Anchors>
☆46Nov 23, 2025Updated 7 months ago
sgp-bench / sgp-bench
View on GitHub
☆30Jul 14, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rookie-joe / FormalAlign
View on GitHub
☆17Jul 12, 2025Updated last year
haoxiongliu / ProofAug
View on GitHub
"Efficient Neural Theorem Proving via Fine-grained Proof Structure Analysis" (ICML 2025) official implementation.
☆16Jun 8, 2025Updated last year
trishullab / clever
View on GitHub
CLEVER: Code Lean Evaluation for Verified End-to-end Reasoning
☆46Apr 3, 2026Updated 3 months ago
yangky11 / miniF2F-lean4
View on GitHub
☆74Mar 25, 2026Updated 3 months ago
pnnl / ML4AlgComb
View on GitHub
ML Benchmarks in Algebraic Combinatorics
☆25Jan 15, 2026Updated 6 months ago
sunblaze-ucb / verina
View on GitHub
Verina (Verifiable Code Generation Arena) is a high-quality benchmark enabling a comprehensive and modular evaluation of code, specificat…
☆73Apr 27, 2026Updated 2 months ago
liziniu / cold_start_rl
View on GitHub
Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?
☆20Mar 9, 2025Updated last year