stepfun-ai / StepFun-Prover-PreviewLinks
Large language models designed for formal theorem proving through tool-integrated reasoning.
☆25Updated last week
Alternatives and similar repositories for StepFun-Prover-Preview
Users that are interested in StepFun-Prover-Preview are comparing it to the libraries listed below
Sorting:
- ☆63Updated last month
- ☆126Updated 3 months ago
- ☆80Updated 6 months ago
- Technical report of Kimina-Prover Preview.☆323Updated last month
- ☆41Updated 11 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆113Updated 2 months ago
- ☆51Updated 2 months ago
- ☆41Updated 3 weeks ago
- ☆86Updated 3 weeks ago
- Solving Inequality Proofs with Large Language Models.☆42Updated this week
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆48Updated this week
- ☆275Updated 3 weeks ago
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆53Updated 9 months ago
- siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems☆172Updated this week
- Accelerate LLM preference tuning via prefix sharing with a single line of code☆43Updated last month
- ☆237Updated 2 months ago
- ☆55Updated last month
- ☆90Updated 2 weeks ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆36Updated 2 months ago
- ☆69Updated 2 months ago
- Flash-Muon: An Efficient Implementation of Muon Optimizer☆160Updated 2 months ago
- ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆108Updated last month
- Physics of Language Models, Part 4☆232Updated 3 weeks ago
- ☆22Updated 2 months ago
- ☆30Updated last week
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient☆51Updated 3 weeks ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆86Updated 4 months ago
- ☆71Updated last week
- Kinetics: Rethinking Test-Time Scaling Laws☆76Updated last month
- ☆69Updated 10 months ago