trishullab / PutnamBench
An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.
☆86Updated last week
Alternatives and similar repositories for PutnamBench:
Users that are interested in PutnamBench are comparing it to the libraries listed below
- A Machine-to-Machine Interaction System for Lean 4.☆51Updated 3 weeks ago
- A simple REPL for Lean 4, returning information about errors and sorries.☆103Updated 2 weeks ago
- ☆40Updated last week
- Catalog Of Math Problems Formalized In Lean☆131Updated this week
- [COLM 2024] A Survey on Deep Learning for Theorem Proving☆167Updated last week
- Benchmark for undergraduate-level formal mathematics☆100Updated 4 months ago
- Proof artifact co-training for Lean☆42Updated 2 years ago
- LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.☆84Updated 8 months ago
- An updated version of miniF2F with lots of fixes and informal statements / solutions.☆72Updated last month
- ☆64Updated last year
- The official repository for the paper Multilingual Mathematical Autoformalization☆33Updated 9 months ago
- Proof recording for Lean 3☆26Updated 3 years ago
- ProofNet dataset ported into Lean 4☆19Updated 9 months ago
- https://albertqjiang.github.io/Portal-to-ISAbelle/☆53Updated last year
- llmstep: [L]LM proofstep suggestions in Lean 4.☆125Updated last year
- ☆30Updated 3 months ago
- White-box automation for Lean 4☆238Updated this week
- A static analysis tool for Lean 4.☆63Updated this week
- Isabelle REPL☆19Updated this week
- Retrieval-Augmented Theorem Provers for Lean☆253Updated 3 weeks ago
- ImProver: Agent-Based Automated Proof Optimization☆23Updated this week
- ☆40Updated last month
- ☆27Updated 3 years ago
- Interactive neural theorem proving in Lean☆117Updated 2 years ago
- Formalizing Euclidean Geometry in Lean☆29Updated 11 months ago
- Lean mathzoo☆24Updated 2 years ago
- LLMs + Lean, on your laptop or in the cloud☆134Updated 3 months ago
- COPRA: An in-COntext PRoof Agent which uses LLMs like GPTs to prove theorems in formal languages.☆55Updated this week
- Neural theorem proving evaluation via the Lean REPL☆17Updated 4 months ago
- Experiments in automation for Lean☆92Updated last week