Sphere-AI-Lab / FormalMATH-BenchLinks
☆51Updated this week
Alternatives and similar repositories for FormalMATH-Bench
Users that are interested in FormalMATH-Bench are comparing it to the libraries listed below
Sorting:
- The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"☆83Updated 2 months ago
- ☆27Updated last week
- ☆25Updated 9 months ago
- ☆43Updated 8 months ago
- The official repository of the Omni-MATH benchmark.☆83Updated 5 months ago
- ☆15Updated 6 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆103Updated this week
- This is the official repository for all the code of TheoremLlama☆42Updated 7 months ago
- Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data☆41Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆98Updated last month
- Automatic solver for plane geometry problems.☆30Updated last month
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆120Updated 8 months ago
- The official repository for the paper Multilingual Mathematical Autoformalization☆36Updated last year
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆63Updated last year
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆70Updated 2 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆55Updated 3 months ago
- ☆69Updated 6 months ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆32Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆129Updated 10 months ago
- Resources for the Enigmata Project.☆32Updated this week
- Code for "Reasoning to Learn from Latent Thoughts"☆104Updated 2 months ago
- Official implementation of "Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving"☆20Updated 3 weeks ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 6 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆102Updated 4 months ago
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆54Updated 4 months ago
- Revisiting Mid-training in the Era of RL Scaling☆48Updated last month
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆53Updated 6 months ago
- ☆45Updated 3 months ago
- A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low a…☆19Updated 3 months ago
- ☆179Updated 2 months ago