rookie-joe / automatic-lean4-compilation
☆13Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for automatic-lean4-compilation
- ☆18Updated last week
- ☆15Updated this week
- ☆51Updated 7 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆95Updated 2 months ago
- ☆35Updated 9 months ago
- ☆63Updated 5 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆102Updated 2 months ago
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆53Updated 8 months ago
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆118Updated 3 weeks ago
- Safety-J: Evaluating Safety with Critique☆13Updated 3 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆82Updated 4 months ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆52Updated last week
- 🌾 OAT: Online AlignmenT for LLMs☆27Updated this week
- ☆24Updated 2 weeks ago
- Explore what LLMs are really leanring over SFT☆26Updated 7 months ago
- Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆21Updated 3 weeks ago
- Collection of papers for scalable automated alignment.☆72Updated 3 weeks ago
- ☆98Updated 5 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆96Updated 6 months ago
- Evaluating Mathematical Reasoning Beyond Accuracy☆37Updated 7 months ago
- ☆10Updated 3 months ago
- Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data☆36Updated 5 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆189Updated 3 months ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆125Updated last year
- Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)☆29Updated 10 months ago
- ☆75Updated last month
- Rewarded soups official implementation☆50Updated last year
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆49Updated 5 months ago
- [ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707☆23Updated last year
- [EMNLP 22] UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression☆26Updated last year