cmu-l3 / minictx-evalLinks

Neural theorem proving evaluation via the Lean REPL

☆23

Alternatives and similar repositories for minictx-eval

Users that are interested in minictx-eval are comparing it to the libraries listed below

Sorting:

Purewhite2019 / rethinking_autoformalization
[ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approach
☆24Updated 7 months ago
Lizn-zn / NeqLIPS
NeqLIPS: a powerful Olympiad-level inequality prover
☆39Updated 4 months ago
albertqjiang / draft_sketch_prove
☆71Updated 2 years ago
trishullab / copra
COPRA: An in-COntext PRoof Agent which uses LLMs like GPTs to prove theorems in formal languages.
☆69Updated last month
roozbeh-yz / IMO-Steps
☆25Updated 5 months ago
kim-em / lean-training-data
☆54Updated last month
trishullab / itp-interface
Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…
☆17Updated this week
albertqjiang / Portal-to-ISAbelle
https://albertqjiang.github.io/Portal-to-ISAbelle/
☆56Updated 2 years ago
zhaoyu-li / DL4TP
[COLM 2024] A Survey on Deep Learning for Theorem Proving
☆213Updated 7 months ago
rookie-joe / PDA
☆35Updated last year
stanford-centaur / PyPantograph
A Machine-to-Machine Interaction System for Lean 4.
☆129Updated 2 weeks ago
riyazahuja / ImProver
ImProver: Agent-Based Automated Proof Optimization
☆39Updated this week
loganrjmurphy / LeanEuclid
LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.
☆122Updated last month
chuanyang-Zheng / Lyra-theorem-prover
The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"
☆16Updated last year
rahul3613 / ProofNet-lean4
ProofNet dataset ported into Lean 4
☆27Updated 7 months ago
cmu-l3 / ntp-toolkit
Neural theorem proving toolkit: data extraction tools for Lean 4
☆34Updated 3 weeks ago
trishullab / PutnamBench
An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.
☆193Updated last week
facebookresearch / miniF2F
An updated version of miniF2F with lots of fixes and informal statements / solutions.
☆97Updated last year
yangky11 / miniF2F-lean4
☆67Updated 2 months ago
albertqjiang / MMA
The official repository for the paper Multilingual Mathematical Autoformalization
☆38Updated last year
haoyuzhao123 / LeanIneqComp
An inequality benchmark for theorem proving
☆21Updated 7 months ago
liuchengwucn / FIMO
☆36Updated last year
trishullab / clever
CLEVER: Code Lean Evaluation for Verified End-to-end Reasoning
☆34Updated 3 weeks ago
augustepoiroux / LeanInteract
LeanInteract: A Python Interface for Lean 4
☆94Updated last month
Miracle-Messi / Isa-AutoFormal
☆15Updated last year
leanprover-community / repl
A simple REPL for Lean 4, returning information about errors and sorries.
☆179Updated last month
zhangir-azerbayev / ProofNet
Benchmark for undergraduate-level formal mathematics
☆113Updated last year
pkshashank / GFLeanTransfer
☆12Updated last year
project-numina / kimina-lean-server
Kimina Lean server (+ client SDK)
☆164Updated this week
wellecks / llmstep
llmstep: [L]LM proofstep suggestions in Lean 4.
☆145Updated 2 years ago