wellecks / llemma_formal2formalLinks
Llemma formal2formal (tactic prediction) theorem proving experiments
☆20Updated 2 years ago
Alternatives and similar repositories for llemma_formal2formal
Users that are interested in llemma_formal2formal are comparing it to the libraries listed below
Sorting:
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆38Updated 2 years ago
- ☆26Updated last year
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆19Updated 2 years ago
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆22Updated 2 months ago
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆23Updated last year
- e☆42Updated 7 months ago
- Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data☆41Updated last year
- This is the official repository for all the code of TheoremLlama☆47Updated 4 months ago
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆78Updated last year
- Solving Inequality Proofs with Large Language Models.☆57Updated last month
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆37Updated last year
- ☆42Updated last year
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆16Updated last year
- ☆17Updated 5 months ago
- COPRA: An in-COntext PRoof Agent which uses LLMs like GPTs to prove theorems in formal languages.☆67Updated 2 weeks ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆40Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆125Updated last year
- Codebase for Inference-Time Policy Adapters☆24Updated 2 years ago
- The official repository for the paper Multilingual Mathematical Autoformalization☆38Updated last year
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆154Updated 3 months ago
- ☆120Updated last year
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Updated 2 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆21Updated 3 years ago
- Evaluate the Quality of Critique☆36Updated last year
- AI for Mathematics Paper List☆17Updated 10 months ago
- SatLM: SATisfiability-Aided Language Models using Declarative Prompting (NeurIPS 2023)☆51Updated last year
- Official code for paper LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning☆29Updated 4 years ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆28Updated last year
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆63Updated last year
- ☆73Updated 4 months ago