Veri-Code / ReFormLinks
☆25Updated last month
Alternatives and similar repositories for ReForm
Users that are interested in ReForm are comparing it to the libraries listed below
Sorting:
- ☆63Updated last month
- ☆25Updated last year
- ☆96Updated last week
- The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"☆107Updated 5 months ago
- ☆30Updated 3 weeks ago
- e☆39Updated 4 months ago
- ☆56Updated 2 months ago
- Solving Inequality Proofs with Large Language Models.☆44Updated last week
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 9 months ago
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆41Updated 2 months ago
- A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low a…☆21Updated 6 months ago
- ☆282Updated last month
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆67Updated last year
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆190Updated this week
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆101Updated 3 weeks ago
- ☆74Updated 9 months ago
- [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluat…☆32Updated 2 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆110Updated 3 months ago
- This is the official repository for all the code of TheoremLlama☆44Updated 3 weeks ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆153Updated last month
- A repo for open research on building large reasoning models☆94Updated this week
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆82Updated 3 months ago
- DafnyBench: A Benchmark for Formal Software Verification☆44Updated 8 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆61Updated 6 months ago
- ☆41Updated 11 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆112Updated 7 months ago
- GenRM-CoT: Data release for verification rationales☆65Updated 10 months ago
- ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆111Updated last week
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆77Updated 5 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆116Updated 5 months ago