wellecks / llemma_formal2formal
Llemma formal2formal (tactic prediction) theorem proving experiments
☆20Updated last year
Alternatives and similar repositories for llemma_formal2formal:
Users that are interested in llemma_formal2formal are comparing it to the libraries listed below
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆22Updated 10 months ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆37Updated 2 years ago
- ☆25Updated 7 months ago
- This is the official repository for all the code of TheoremLlama☆40Updated 6 months ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆26Updated last year
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆18Updated last year
- The official repository for the paper Multilingual Mathematical Autoformalization☆35Updated 10 months ago
- ☆37Updated 6 months ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆34Updated last year
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆25Updated last year
- Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data☆40Updated 10 months ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆31Updated 11 months ago
- Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09…☆20Updated 4 months ago
- AI for Mathematics Paper List☆17Updated 3 months ago
- COPRA: An in-COntext PRoof Agent which uses LLMs like GPTs to prove theorems in formal languages.☆58Updated 2 weeks ago
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆66Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 7 months ago
- The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agen…☆26Updated last year
- Evaluate the Quality of Critique☆34Updated 10 months ago
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆16Updated 9 months ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆23Updated last year
- ☆34Updated last year
- ☆29Updated 3 months ago
- ☆27Updated last year
- Supporting code for ReCEval paper☆28Updated 7 months ago
- Scratchpad/Chain-of-Thought Prompts☆12Updated 2 years ago
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- ☆95Updated last year
- Harmonic Datasets☆37Updated 9 months ago
- Benchmarking Benchmark Leakage in Large Language Models☆51Updated 10 months ago