Llemma formal2formal (tactic prediction) theorem proving experiments
☆20Oct 17, 2023Updated 2 years ago
Alternatives and similar repositories for llemma_formal2formal
Users that are interested in llemma_formal2formal are comparing it to the libraries listed below
Sorting:
- ☆17May 31, 2023Updated 2 years ago
- Debate interface, experiments, etc.☆10Mar 12, 2024Updated last year
- https://albertqjiang.github.io/Portal-to-ISAbelle/☆57Sep 6, 2023Updated 2 years ago
- Python wrapper for lean-gym☆13Apr 5, 2023Updated 2 years ago
- ☆18Oct 12, 2022Updated 3 years ago
- The code of CIKM 2023 (Oral Presentation) : A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NE…☆14Jul 19, 2024Updated last year
- ☆17Jul 12, 2025Updated 7 months ago
- ☆30Dec 27, 2024Updated last year
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- The is the official implementation of "Lyra: Orchestrating Dual Correction in Automated Theorem Proving"☆15Jul 2, 2024Updated last year
- Example formalization of Game Theoretic concepts in Lean☆26Feb 14, 2025Updated last year
- ☆44Sep 19, 2024Updated last year
- ☆71Sep 30, 2023Updated 2 years ago
- Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data☆42May 29, 2024Updated last year
- ☆25Nov 19, 2025Updated 3 months ago
- llmstep: [L]LM proofstep suggestions in Lean 4.☆148Nov 11, 2023Updated 2 years ago
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆24May 29, 2024Updated last year
- Retrieval-Augmented Theorem Provers for Lean☆317Jan 30, 2025Updated last year
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- ☆25Aug 23, 2024Updated last year
- This is the official implementation for paper "PENCIL: Long Thoughts with Short Memory".☆73May 9, 2025Updated 10 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 6 months ago
- ☆67Oct 18, 2025Updated 4 months ago
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- ☆29May 8, 2024Updated last year
- A search engine for Lean 4 declarations☆54Feb 10, 2026Updated 3 weeks ago
- ☆47Aug 5, 2025Updated 7 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach☆32Nov 6, 2023Updated 2 years ago
- ☆72Apr 2, 2024Updated last year
- ☆71Oct 23, 2025Updated 4 months ago
- The official code for "GUI-ReWalk: Massive Data Generation for GUI Agent via Stochastic Exploration and Intent-Aware Reasoning"☆29Jan 28, 2026Updated last month
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- ☆83Apr 18, 2024Updated last year
- Machine learning for molecules workshop 2022☆13Nov 30, 2022Updated 3 years ago
- This is the official implementation for MA-LoT.☆19Aug 4, 2025Updated 7 months ago
- FactNews is the first dataset to predict sentence-level factuality of news reporting. Furthemore, we provide baseline results for sentenc…☆11Jun 12, 2025Updated 8 months ago
- web programming course (COMPSCI 326, UMass Amherst)☆14Sep 13, 2022Updated 3 years ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago