☆85Jan 25, 2025Updated last year
Alternatives and similar repositories for odyssey-math
Users that are interested in odyssey-math are comparing it to the libraries listed below
Sorting:
- List of awesome works that use AI for mathematical discoveries.☆31Feb 21, 2026Updated last week
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 5 months ago
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- [ICLR 2025] Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist☆35Oct 23, 2024Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Dec 23, 2024Updated last year
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆42Jul 19, 2024Updated last year
- The Lean Theorem Proving Environment☆14May 7, 2023Updated 2 years ago
- ☆43Sep 19, 2024Updated last year
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆120Dec 10, 2024Updated last year
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆184Jun 8, 2025Updated 8 months ago
- A unified benchmark for math reasoning☆90Jan 25, 2023Updated 3 years ago
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- The official repository of the Omni-MATH benchmark.☆93Dec 22, 2024Updated last year
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆17Feb 26, 2024Updated 2 years ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆77Oct 9, 2025Updated 4 months ago
- Kimina Lean server (+ client SDK)☆183Jan 11, 2026Updated last month
- ☆30Dec 27, 2024Updated last year
- The FATE (Formal Algebra Theorem Evaluation) benchmarks.☆42Feb 23, 2026Updated last week
- Language models scale reliably with over-training and on downstream tasks☆100Apr 2, 2024Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆150Jul 12, 2024Updated last year
- Solving Inequality Proofs with Large Language Models.☆58Dec 15, 2025Updated 2 months ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆39Mar 24, 2023Updated 2 years ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆273Apr 26, 2024Updated last year
- Trending projects & awesome papers about data-centric llm studies.☆40May 20, 2025Updated 9 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86May 21, 2025Updated 9 months ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆33Aug 13, 2025Updated 6 months ago
- The official repository for the paper Multilingual Mathematical Autoformalization☆38May 20, 2024Updated last year
- ☆76Jan 8, 2026Updated last month
- ☆58Sep 2, 2024Updated last year
- ☆26Jul 16, 2025Updated 7 months ago
- A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low a…☆26Feb 14, 2025Updated last year
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- ☆130Jul 8, 2024Updated last year
- A formal proof of the irrationality of zeta(3), the Apéry constant [maintainer=@amahboubi,@pi8027]☆25Updated this week
- ☆26Nov 1, 2021Updated 4 years ago
- LLMs + Lean, on your laptop or in the cloud☆202Oct 10, 2025Updated 4 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆71Jul 13, 2025Updated 7 months ago
- Mathport is a tool for porting Lean3 projects to Lean4☆44Nov 21, 2024Updated last year