sunblaze-ucb / verinaLinks
Verina (Verifiable Code Generation Arena) is a high-quality benchmark enabling a comprehensive and modular evaluation of code, specification, and proof generation as well as their compositions.
☆33Updated last month
Alternatives and similar repositories for verina
Users that are interested in verina are comparing it to the libraries listed below
Sorting:
- A Machine-to-Machine Interaction System for Lean 4.☆122Updated 2 weeks ago
- [FSE-2024] Towards AI-Assisted Synthesis of Verified Dafny Methods☆54Updated last year
- https://albertqjiang.github.io/Portal-to-ISAbelle/☆56Updated 2 years ago
- AlphaVerus: Formally Verified Code Generation through Self-Improving Translation and Treefinement☆21Updated 6 months ago
- A Foreign Function Interface (FFI) to cvc5 solver in Lean.☆19Updated this week
- CLEVER: Code Lean Evaluation for Verified End-to-end Reasoning☆32Updated last week
- ☆53Updated last week
- ImProver: Agent-Based Automated Proof Optimization☆39Updated last month
- Python client to interact with the lean4 language server.☆30Updated last week
- ☆23Updated 2 months ago
- Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…☆17Updated 2 weeks ago
- ☆22Updated 2 years ago
- (Mirror) A Machine-to-Machine Interaction System for Lean 4☆42Updated last week
- ☆64Updated last week
- Code for the paper: Proving Theorems Recursively☆12Updated last year
- Tools based on AI for helping with Lean 4☆106Updated this week
- ☆15Updated last year
- A Rocq version of the miniF2F dataset☆21Updated last month
- A Lean4 script for robustly verifying submitted proofs of theorems and implementations of functions☆24Updated 2 months ago
- A simple REPL for Lean 4, returning information about errors and sorries.☆171Updated 2 weeks ago
- Clover: Closed-Loop Verifiable Code Generation☆37Updated 6 months ago
- ☆71Updated 2 years ago
- LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.☆114Updated 2 weeks ago
- ☆66Updated last month
- An inequality benchmark for theorem proving☆21Updated 6 months ago
- An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.☆184Updated last week
- Neural theorem proving evaluation via the Lean REPL☆23Updated 4 months ago
- Neural theorem proving toolkit: data extraction tools for Lean 4☆33Updated last week
- Kimina Lean server (+ client SDK)☆146Updated 3 weeks ago
- Benchmark for undergraduate-level formal mathematics☆113Updated last year