trishullab / PutnamBenchLinks
An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.
☆140Updated last week
Alternatives and similar repositories for PutnamBench
Users that are interested in PutnamBench are comparing it to the libraries listed below
Sorting:
- [COLM 2024] A Survey on Deep Learning for Theorem Proving☆195Updated last month
- LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.☆101Updated 2 months ago
- ☆60Updated last week
- Benchmark for undergraduate-level formal mathematics☆108Updated 9 months ago
- ☆34Updated 8 months ago
- A simple REPL for Lean 4, returning information about errors and sorries.☆133Updated last week
- ☆48Updated 5 months ago
- Retrieval-Augmented Theorem Provers for Lean☆279Updated 5 months ago
- Catalog Of Math Problems Formalized In Lean☆176Updated this week
- ☆67Updated last year
- An updated version of miniF2F with lots of fixes and informal statements / solutions.☆87Updated 6 months ago
- https://albertqjiang.github.io/Portal-to-ISAbelle/☆56Updated last year
- llmstep: [L]LM proofstep suggestions in Lean 4.☆137Updated last year
- A Machine-to-Machine Interaction System for Lean 4.☆102Updated this week
- A "code intepreter" for Lean☆60Updated this week
- LLMs + Lean, on your laptop or in the cloud☆168Updated last month
- NeqLIPS: a powerful Olympiad-level inequality prover☆37Updated 2 months ago
- Kimina Lean server☆90Updated this week
- Proof artifact co-training for Lean☆45Updated 2 years ago
- Neural theorem proving evaluation via the Lean REPL☆23Updated 8 months ago
- The official repository for the paper Multilingual Mathematical Autoformalization☆36Updated last year
- Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…☆16Updated 2 months ago
- COPRA: An in-COntext PRoof Agent which uses LLMs like GPTs to prove theorems in formal languages.☆62Updated 2 months ago
- An inequality benchmark for theorem proving☆15Updated last month
- ☆31Updated 6 months ago
- ☆18Updated last month
- ProofNet dataset ported into Lean 4☆22Updated last month
- Proof recording for Lean 3☆27Updated 3 years ago
- ImProver: Agent-Based Automated Proof Optimization☆33Updated last week
- Code for the paper: Proving Theorems Recursively☆12Updated last year