A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
☆287Sep 25, 2025Updated 5 months ago
Alternatives and similar repositories for DeepMath
Users that are interested in DeepMath are comparing it to the libraries listed below
Sorting:
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆742Jun 6, 2025Updated 9 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 5 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆222Nov 27, 2025Updated 3 months ago
- ☆1,111Jan 10, 2026Updated 2 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,225Aug 27, 2025Updated 6 months ago
- [NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)☆32Aug 8, 2025Updated 7 months ago
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆110Apr 4, 2025Updated 11 months ago
- Official Repo for Open-Reasoner-Zero☆2,085Jun 2, 2025Updated 9 months ago
- [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"☆43May 22, 2025Updated 9 months ago
- Democratizing Reinforcement Learning for LLMs☆5,219Mar 13, 2026Updated last week
- ☆332May 31, 2025Updated 9 months ago
- Reproducing R1 for Code with Reliable Rewards☆295May 5, 2025Updated 10 months ago
- ☆761Dec 23, 2025Updated 2 months ago
- ☆810Jun 9, 2025Updated 9 months ago
- ☆34Nov 18, 2025Updated 4 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆476May 17, 2025Updated 10 months ago
- ☆76Jun 28, 2025Updated 8 months ago
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆187May 20, 2025Updated 10 months ago
- Evaluation of LLMs on latest math competitions☆229Mar 10, 2026Updated last week
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆73Feb 25, 2025Updated last year
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆34Nov 11, 2025Updated 4 months ago
- Scalable RL solution for advanced reasoning of language models☆1,813Mar 18, 2025Updated last year
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 3 months ago
- Reinforcing General Reasoning without Verifiers☆97Jun 24, 2025Updated 8 months ago
- ☆29Jan 23, 2024Updated 2 years ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆745Jun 6, 2025Updated 9 months ago
- Simple RL training for reasoning☆3,834Dec 23, 2025Updated 2 months ago
- ☆74Jun 10, 2025Updated 9 months ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆254Jul 13, 2025Updated 8 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆134Jan 31, 2026Updated last month
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆175Nov 4, 2025Updated 4 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆424Jul 11, 2025Updated 8 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆569May 6, 2025Updated 10 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated last year
- a-m-team's exploration in large language modeling☆194May 29, 2025Updated 9 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Dec 23, 2024Updated last year
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 4 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆64Jul 8, 2024Updated last year
- Recipes to train the self-rewarding reasoning LLMs.☆231Mar 2, 2025Updated last year