☆47Aug 5, 2025Updated 7 months ago
Alternatives and similar repositories for CriticLean
Users that are interested in CriticLean are comparing it to the libraries listed below
Sorting:
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆24May 29, 2024Updated last year
- ☆17Jul 12, 2025Updated 7 months ago
- [ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approach☆27May 20, 2025Updated 9 months ago
- StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion☆24Aug 19, 2025Updated 6 months ago
- ☆35Jan 10, 2025Updated last year
- Benchmark for undergraduate-level formal mathematics☆117Oct 14, 2024Updated last year
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- ☆21Oct 31, 2025Updated 4 months ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆31Dec 6, 2023Updated 2 years ago
- An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.☆217Feb 22, 2026Updated last week
- ☆13Mar 27, 2024Updated last year
- ☆17May 31, 2023Updated 2 years ago
- The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"☆117Mar 28, 2025Updated 11 months ago
- The code of CIKM 2023 (Oral Presentation) : A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NE…☆14Jul 19, 2024Updated last year
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated last week
- ☆14Oct 11, 2023Updated 2 years ago
- A static analysis tool for Lean 4.☆114Updated this week
- Solving Inequality Proofs with Large Language Models.☆58Dec 15, 2025Updated 2 months ago
- Llemma formal2formal (tactic prediction) theorem proving experiments☆20Oct 17, 2023Updated 2 years ago
- ☆16Oct 27, 2024Updated last year
- ☆25Jun 10, 2025Updated 8 months ago
- Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…☆18Feb 19, 2026Updated 2 weeks ago
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 5 months ago
- ☆25Dec 13, 2024Updated last year
- ☆28Jun 12, 2025Updated 8 months ago
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- ☆33Jul 15, 2025Updated 7 months ago
- Official implementation of "Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving"☆29May 8, 2025Updated 9 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated 11 months ago
- Formalization of IMO shortlist problems in Lean 4☆25Updated this week
- A Foreign Function Interface (FFI) to cvc5 solver in Lean.☆24Feb 16, 2026Updated 2 weeks ago
- ☆46Jun 11, 2025Updated 8 months ago
- Kimina Lean server (+ client SDK)☆183Jan 11, 2026Updated last month
- ☆25Nov 19, 2025Updated 3 months ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆33Aug 13, 2025Updated 6 months ago
- Neural theorem proving evaluation via the Lean REPL☆23Jul 12, 2025Updated 7 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆102Feb 20, 2025Updated last year
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 6 months ago
- A simple REPL for Lean 4, returning information about errors and sorries.☆191Feb 25, 2026Updated last week