multimodal-art-projection / CriticLeanView external linksLinks
☆47Aug 5, 2025Updated 6 months ago
Alternatives and similar repositories for CriticLean
Users that are interested in CriticLean are comparing it to the libraries listed below
Sorting:
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆24May 29, 2024Updated last year
- ☆17Jul 12, 2025Updated 7 months ago
- [ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approach☆27May 20, 2025Updated 8 months ago
- StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion☆23Aug 19, 2025Updated 5 months ago
- ☆35Jan 10, 2025Updated last year
- Benchmark for undergraduate-level formal mathematics☆116Oct 14, 2024Updated last year
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- ☆12Mar 27, 2024Updated last year
- ☆20Oct 31, 2025Updated 3 months ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆31Dec 6, 2023Updated 2 years ago
- An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.☆212Jan 11, 2026Updated last month
- ☆17May 31, 2023Updated 2 years ago
- The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"☆117Mar 28, 2025Updated 10 months ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆31Sep 19, 2025Updated 4 months ago
- ☆14Oct 11, 2023Updated 2 years ago
- The code of CIKM 2023 (Oral Presentation) : A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NE…☆14Jul 19, 2024Updated last year
- ☆30Dec 27, 2024Updated last year
- A static analysis tool for Lean 4.☆113Updated this week
- Solving Inequality Proofs with Large Language Models.☆56Dec 15, 2025Updated 2 months ago
- ☆16Oct 27, 2024Updated last year
- Llemma formal2formal (tactic prediction) theorem proving experiments☆20Oct 17, 2023Updated 2 years ago
- Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…☆18Jan 16, 2026Updated 3 weeks ago
- ☆25Jun 10, 2025Updated 8 months ago
- ☆28Jun 12, 2025Updated 8 months ago
- ☆25Dec 13, 2024Updated last year
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- A Foreign Function Interface (FFI) to cvc5 solver in Lean.☆23Jan 27, 2026Updated 2 weeks ago
- ☆33Jul 15, 2025Updated 6 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆22Mar 18, 2025Updated 10 months ago
- Official implementation of "Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving"☆29May 8, 2025Updated 9 months ago
- ☆46Jun 11, 2025Updated 8 months ago
- Kimina Lean server (+ client SDK)☆179Jan 11, 2026Updated last month
- ☆25Nov 19, 2025Updated 2 months ago
- Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without New Demonstrations", from USC / Amazon Robotics☆35Aug 15, 2025Updated 6 months ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆32Aug 13, 2025Updated 6 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 5 months ago
- Syntax for searching with natural language from Lean, using https://leansearch.net/ (may extend to other services)☆29Updated this week
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆101Feb 20, 2025Updated 11 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated last year