jinpz / dtv
The official code release for Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization
☆27Updated last week
Alternatives and similar repositories for dtv:
Users that are interested in dtv are comparing it to the libraries listed below
- ☆27Updated 2 months ago
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆58Updated last year
- The official repository for the paper Multilingual Mathematical Autoformalization☆34Updated 10 months ago
- ☆64Updated last year
- ☆32Updated 4 months ago
- https://albertqjiang.github.io/Portal-to-ISAbelle/☆53Updated last year
- Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data☆39Updated 9 months ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆36Updated last year
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆14Updated 8 months ago
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆22Updated 9 months ago
- ☆24Updated 6 months ago
- ☆48Updated last month
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆62Updated last year
- DafnyBench: A Benchmark for Formal Software Verification☆25Updated 3 months ago
- Harmonic Datasets☆36Updated 8 months ago
- Tutorial on neural theorem proving☆166Updated last year
- [COLM 2024] A Survey on Deep Learning for Theorem Proving☆171Updated last month
- An updated version of miniF2F with lots of fixes and informal statements / solutions.☆77Updated 2 months ago
- ProofNet dataset ported into Lean 4☆19Updated 10 months ago
- Neural theorem proving tutorial, version II☆34Updated 10 months ago
- ☆14Updated 7 months ago
- ☆83Updated last month
- SatLM: SATisfiability-Aided Language Models using Declarative Prompting (NeurIPS 2023)☆48Updated 8 months ago
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆138Updated 5 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆103Updated 11 months ago
- Official code for paper: INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving☆39Updated 2 years ago
- An environment for learning formal mathematical reasoning from scratch☆65Updated 7 months ago
- llmstep: [L]LM proofstep suggestions in Lean 4.☆126Updated last year