Veri-Code / ReFormLinks
☆38Updated 4 months ago
Alternatives and similar repositories for ReForm
Users that are interested in ReForm are comparing it to the libraries listed below
Sorting:
- DafnyBench: A Benchmark for Formal Software Verification☆52Updated last year
- ☆74Updated 5 months ago
- ☆17Updated 5 months ago
- ☆133Updated 3 months ago
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆68Updated last year
- Reproducing R1 for Code with Reliable Rewards☆278Updated 7 months ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆210Updated 5 months ago
- The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"☆115Updated 8 months ago
- [COLM 2024] A Survey on Deep Learning for Theorem Proving☆211Updated 6 months ago
- [ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approach☆25Updated 7 months ago
- ☆15Updated last year
- Technical report of Kimina-Prover Preview.☆348Updated 5 months ago
- ☆41Updated this week
- ☆26Updated last year
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆144Updated this week
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆337Updated 2 months ago
- ☆35Updated 11 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆119Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆147Updated last year
- ☆76Updated last year
- This is the official implementation for paper "PENCIL: Long Thoughts with Short Memory".☆70Updated 7 months ago
- ☆48Updated 3 months ago
- [NeurIPS 2024] MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems☆92Updated last year
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆78Updated last year
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆115Updated 4 months ago
- ☆220Updated 8 months ago
- ☆66Updated last month
- e☆42Updated 7 months ago
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆85Updated last year
- Async pipelined version of Verl☆125Updated 8 months ago