sunblaze-ucb/verina

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sunblaze-ucb/verina)

sunblaze-ucb / verina

Verina (Verifiable Code Generation Arena) is a high-quality benchmark enabling a comprehensive and modular evaluation of code, specification, and proof generation as well as their compositions.

☆74

Alternatives and similar repositories for verina

Users that are interested in verina are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

trishullab / clever
View on GitHub
CLEVER: Code Lean Evaluation for Verified End-to-end Reasoning
☆47Apr 3, 2026Updated 3 months ago
kfdong / STP
View on GitHub
The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"
☆122Mar 28, 2025Updated last year
Lizn-zn / NeqLIPS
View on GitHub
NeqLIPS: a powerful Olympiad-level inequality prover
☆40Sep 7, 2025Updated 10 months ago
wiio12 / POETRY
View on GitHub
Code for the paper: Proving Theorems Recursively
☆12May 23, 2024Updated 2 years ago
trishullab / itp-interface
View on GitHub
Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…
☆19Jul 10, 2026Updated 2 weeks ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
JOSHCLUNE / LeanHammer
View on GitHub
LeanHammer is an automated reasoning tool for Lean that brings together multiple proof search and reconstruction techniques and combines …
☆97Jul 16, 2026Updated last week
Huawei-AI4Math / ProofFlow
View on GitHub
☆23Jun 28, 2026Updated last month
Beneficial-AI-Foundation / vericoding
View on GitHub
tools and benchmarks for verified coding
☆27Jun 5, 2026Updated last month
microsoft / DSP-Plus
View on GitHub
Implementation and subsequent optimization for "Reviving DSP for Advanced Theorem Proving in the Era of Reasoning Models"
☆28Jun 16, 2025Updated last year
haoxiongliu / ProofAug
View on GitHub
"Efficient Neural Theorem Proving via Fine-grained Proof Structure Analysis" (ICML 2025) official implementation.
☆16Jun 8, 2025Updated last year
stanford-centaur / PyPantograph
View on GitHub
A Machine-to-Machine Interaction System for Lean 4.
☆145Jun 30, 2026Updated 3 weeks ago
logsem / gitrees
View on GitHub
guarded interaction trees
☆14Jul 6, 2026Updated 3 weeks ago
arthurpaulino / NumLean
View on GitHub
A Lean 4 package for heavy numerical computations
☆20Jan 16, 2022Updated 4 years ago
verse-lab / sisyphus
View on GitHub
Mostly Automated Proof Repair for Verified Libraries
☆16Jun 1, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / wybecoder
View on GitHub
WybeCoder Verified Generation of Imperative Code with LLMs
☆36May 6, 2026Updated 2 months ago
leanprover / lean-eval-leaderboard
View on GitHub
Results for the lean-eval benchmark (https://github.com/leanprover/lean-eval)
☆17Updated this week
albertqjiang / INT
View on GitHub
Official code for paper: INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving
☆40Dec 12, 2022Updated 3 years ago
Miracle-Messi / Isa-AutoFormal
View on GitHub
☆17Oct 27, 2024Updated last year
Sphere-AI-Lab / FormalMATH-Bench
View on GitHub
Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>
☆75Jan 8, 2026Updated 6 months ago
BartoszPiotrowski / lean-premise-selection
View on GitHub
☆22Jan 14, 2026Updated 6 months ago
chuanyang-Zheng / Lyra-theorem-prover
View on GitHub
The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"
☆15Jul 2, 2024Updated 2 years ago
kim-em / lean-training-data
View on GitHub
☆62Dec 1, 2025Updated 7 months ago
leanprover-community / lean-auto
View on GitHub
Experiments on automation for Lean
☆180Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
insait-institute / open-proof-corpus
View on GitHub
This repository contains the code for the paper The Open Proof Corpus: Building a Large-Scale, Human-Validated Dataset of LLM-Generated P…
☆18Aug 4, 2025Updated 11 months ago
Veri-Code / ReForm
View on GitHub
☆43May 7, 2026Updated 2 months ago
liuchengwucn / FIMO
View on GitHub
☆38Jun 30, 2026Updated 3 weeks ago
peregrine-project / peregrine-tool
View on GitHub
Verified compiler from LambdaBox to WebAssembly, C, Rust, and OCaml
☆25Jul 21, 2026Updated last week
loganrjmurphy / LeanEuclid
View on GitHub
LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.
☆139Nov 25, 2025Updated 8 months ago
Purewhite2019 / rethinking_autoformalization
View on GitHub
[ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approach
☆31May 20, 2025Updated last year
haoyuzhao123 / LeanIneqComp
View on GitHub
An inequality benchmark for theorem proving
☆22Feb 1, 2026Updated 5 months ago
zhaoyu-li / DL4TP
View on GitHub
[COLM 2024] A Survey on Deep Learning for Theorem Proving
☆228May 28, 2025Updated last year
dwrensha / compfiles
View on GitHub
Catalog Of Math Problems Formalized In Lean
☆249Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
leanprover-community / plausible
View on GitHub
☆109Jul 15, 2026Updated 2 weeks ago
project-numina / kimina-lean-server
View on GitHub
Kimina Lean server (+ client SDK)
☆206Jan 11, 2026Updated 6 months ago
Beneficial-AI-Foundation / vericoding-benchmark
View on GitHub
☆42Jun 5, 2026Updated last month
KellyJDavis / goedels-poetry
View on GitHub
A recursive, reflective POETRY algorithm variant using Goedel-Prover-V2
☆34Mar 9, 2026Updated 4 months ago
LLM4Rocq / miniF2F-rocq
View on GitHub
A Rocq version of the miniF2F dataset
☆26Updated this week
kim-em / lean-zip
View on GitHub
☆109Updated this week
augustepoiroux / LeanInteract
View on GitHub
LeanInteract: A Python Interface for Lean 4
☆126Jul 17, 2026Updated last week