โ487Jul 22, 2024Updated last year
Alternatives and similar repositories for aimo-progress-prize
Users that are interested in aimo-progress-prize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- โ85Jul 10, 2024Updated last year
- [NeurIPS'24] Official code for *๐ฏDART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*โ121Dec 10, 2024Updated last year
- [COLM 2024] A Survey on Deep Learning for Theorem Provingโ219May 28, 2025Updated 9 months ago
- State-of-the-art bilingual open-sourced Math reasoning LLMs.โ544Oct 22, 2024Updated last year
- The official repository for the paper Multilingual Mathematical Autoformalizationโ38May 20, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- โ342Jun 5, 2025Updated 9 months ago
- Technical report of Kimina-Prover Preview.โ364Jul 10, 2025Updated 8 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witโฆโ152Jul 12, 2024Updated last year
- AI for Mathematics Paper Listโ17Jan 14, 2025Updated last year
- The official repository of the Omni-MATH benchmark.โ93Dec 22, 2024Updated last year
- โ412Feb 13, 2026Updated last month
- A simple REPL for Lean 4, returning information about errors and sorries.โ192Mar 10, 2026Updated 2 weeks ago
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting witโฆโ1,114Feb 22, 2024Updated 2 years ago
- Kimina Lean server (+ client SDK)โ187Jan 11, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean โข AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- โ17Jul 12, 2025Updated 8 months ago
- The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"โ117Mar 28, 2025Updated 11 months ago
- Code & data for ICLR 2024 spotlight paper: ๐ฏMUSTARD: Mastering Uniform Synthesis of Theorem and Proof Dataโ42May 29, 2024Updated last year
- โ85Jan 25, 2025Updated last year
- LLMs + Lean, on your laptop or in the cloudโ204Oct 10, 2025Updated 5 months ago
- โ72Sep 30, 2023Updated 2 years ago
- Catalog Of Math Problems Formalized In Leanโ238Mar 19, 2026Updated last week
- [NeurlPS D&B 2024] Generative AI for Math: MathPileโ420Apr 4, 2025Updated 11 months ago
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"โ15Jul 2, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- โ1,113Jan 10, 2026Updated 2 months ago
- Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>โ77Jan 8, 2026Updated 2 months ago
- The MATH Dataset (NeurIPS 2021)โ1,328Sep 6, 2025Updated 6 months ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feeโฆโ38Jul 25, 2024Updated last year
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจโ273Apr 26, 2024Updated last year
- โ1,033Dec 17, 2024Updated last year
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.โ329Jan 29, 2026Updated last month
- Benchmark for undergraduate-level formal mathematicsโ117Oct 14, 2024Updated last year
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"โ392Jan 19, 2025Updated last year
- Proton VPN Special Offer - Get 70% off โข AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- โ30Dec 27, 2024Updated last year
- โ229Apr 4, 2025Updated 11 months ago
- The rule-based evaluation subset and code implementation of Omni-MATHโ27Dec 23, 2024Updated last year
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Datasetโ112May 22, 2025Updated 10 months ago
- A Machine-to-Machine Interaction System for Lean 4.โ136Feb 24, 2026Updated last month
- Formal to Formal Mathematics Benchmarkโ420Aug 16, 2023Updated 2 years ago
- โ25Aug 23, 2024Updated last year