rookie-joe / FormalAlignLinks
☆17Updated 6 months ago
Alternatives and similar repositories for FormalAlign
Users that are interested in FormalAlign are comparing it to the libraries listed below
Sorting:
- ☆35Updated last year
- ☆25Updated last year
- Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data☆41Updated last year
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆67Updated last year
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆16Updated last year
- ☆16Updated last year
- ☆74Updated 2 weeks ago
- Collections of RLxLM experiments using minimal codes☆14Updated 11 months ago
- The official repository of the Omni-MATH benchmark.☆93Updated last year
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆24Updated last year
- ☆30Updated last year
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆76Updated 3 months ago
- [ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approach☆24Updated 8 months ago
- ☆13Updated last year
- The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"☆117Updated 9 months ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆39Updated 2 years ago
- Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"☆19Updated 2 years ago
- [ICLR 2025] Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist☆34Updated last year
- The official repository for the paper Multilingual Mathematical Autoformalization☆38Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆150Updated last year
- ☆36Updated last year
- ☆142Updated 4 months ago
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Updated 3 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆64Updated last year
- Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows"☆37Updated 2 months ago
- ☆71Updated 2 years ago
- ☆16Updated last year
- ☆40Updated last month
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆29Updated last year
- Neural theorem proving evaluation via the Lean REPL☆23Updated 6 months ago