rookie-joe / automatic-lean4-compilation
☆15Updated 6 months ago
Alternatives and similar repositories for automatic-lean4-compilation:
Users that are interested in automatic-lean4-compilation are comparing it to the libraries listed below
- ☆26Updated last month
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆115Updated 5 months ago
- ☆23Updated last week
- GenRM-CoT: Data release for verification rationales☆47Updated 4 months ago
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆58Updated 11 months ago
- ☆64Updated 10 months ago
- Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data☆39Updated 8 months ago
- ☆64Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆114Updated 7 months ago
- [COLM 2024] A Survey on Deep Learning for Theorem Proving☆167Updated last week
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆135Updated 4 months ago
- Explore what LLMs are really leanring over SFT☆28Updated 10 months ago
- ☆130Updated 2 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆121Updated 3 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆106Updated 5 months ago
- ☆35Updated this week
- The official repository of the Omni-MATH benchmark.☆71Updated last month
- Rewarded soups official implementation☆55Updated last year
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆173Updated 9 months ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆48Updated 2 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆289Updated 6 months ago
- ☆54Updated 3 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆123Updated last month
- Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation☆20Updated 11 months ago
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆14Updated 7 months ago
- ☆93Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆62Updated 3 months ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆29Updated 8 months ago
- ☆41Updated 3 months ago
- ☆40Updated last month