wellecks / llemma_formal2formal
Llemma formal2formal (tactic prediction) theorem proving experiments
☆20Updated last year
Alternatives and similar repositories for llemma_formal2formal:
Users that are interested in llemma_formal2formal are comparing it to the libraries listed below
- Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09…☆19Updated 3 months ago
- ☆24Updated 7 months ago
- ☆36Updated 6 months ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆36Updated 2 years ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆34Updated last year
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆18Updated last year
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆22Updated 10 months ago
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆30Updated 10 months ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆25Updated last year
- Evaluate the Quality of Critique☆34Updated 9 months ago
- ☆29Updated 3 months ago
- This is the official repository for all the code of TheoremLlama☆39Updated 5 months ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆25Updated last year
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆27Updated last year
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆16Updated 9 months ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆23Updated last year
- The official repository for the paper Multilingual Mathematical Autoformalization☆34Updated 10 months ago
- ☆34Updated last year
- Repository for Skill Set Optimization☆12Updated 8 months ago
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆63Updated last year
- Tasks for describing differences between text distributions.☆16Updated 7 months ago
- ☆26Updated 10 months ago
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆84Updated last month
- Self-Supervised Alignment with Mutual Information☆16Updated 10 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆35Updated last month
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆26Updated 2 years ago
- ☆39Updated 2 years ago
- COPRA: An in-COntext PRoof Agent which uses LLMs like GPTs to prove theorems in formal languages.☆58Updated 3 weeks ago
- Benchmarking Benchmark Leakage in Large Language Models☆52Updated 10 months ago
- ☆27Updated last year