GammaTauAI / WeeklyReadingsArchive
Papers read during our weekly reading group
☆12Updated 7 months ago
Alternatives and similar repositories for WeeklyReadingsArchive:
Users that are interested in WeeklyReadingsArchive are comparing it to the libraries listed below
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions☆41Updated 8 months ago
- The official repository for the paper Multilingual Mathematical Autoformalization☆35Updated 11 months ago
- https://albertqjiang.github.io/Portal-to-ISAbelle/☆54Updated last year
- LLMs + Lean, on your laptop or in the cloud☆144Updated 3 weeks ago
- RES-Q: Evaluating the Code-Editing Capability of Large Language Model Systems at the Repository Scale☆26Updated 10 months ago
- HyperTree Proof Search for Neural Theorem Proving -- "La science est l'œuvre de l'esprit humain, qui est plutôt destiné à étudier qu'à co…☆37Updated 8 months ago
- llmstep: [L]LM proofstep suggestions in Lean 4.☆130Updated last year
- COPRA: An in-COntext PRoof Agent which uses LLMs like GPTs to prove theorems in formal languages.☆59Updated this week
- An updated version of miniF2F with lots of fixes and informal statements / solutions.☆83Updated 3 months ago
- Harmonic Datasets☆38Updated 9 months ago
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆62Updated 3 months ago
- An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.☆109Updated last week
- ☆50Updated last month
- ☆65Updated last year
- Formalizing stochastic doubly-efficient debate☆102Updated 6 months ago
- LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.☆98Updated 10 months ago
- Tutorial on neural theorem proving☆173Updated last year
- NeqLIPS: a powerful Olympiad-level inequality prover☆31Updated 2 weeks ago
- DafnyBench: A Benchmark for Formal Software Verification☆31Updated 4 months ago
- A scalable abstraction learning library☆78Updated last year
- Clover: Closed-Loop Verifiable Code Generation☆35Updated 11 months ago
- Benchmark for undergraduate-level formal mathematics☆105Updated 6 months ago
- Neural theorem proving toolkit: data extraction tools for Lean 4☆25Updated last month
- A simple REPL for Lean 4, returning information about errors and sorries.☆113Updated 2 weeks ago
- A Machine-to-Machine Interaction System for Lean 4.☆74Updated this week
- ☆13Updated 8 months ago
- ☆27Updated 3 years ago
- [COLM 2024] A Survey on Deep Learning for Theorem Proving☆176Updated 2 months ago
- A domain-specific probabilistic programming language for modeling and inference with language models☆128Updated last year
- General-purpose program synthesiser☆45Updated 6 months ago