A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
☆282Sep 25, 2025Updated 5 months ago
Alternatives and similar repositories for DeepMath
Users that are interested in DeepMath are comparing it to the libraries listed below
Sorting:
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆741Jun 6, 2025Updated 8 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆221Nov 27, 2025Updated 3 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 4 months ago
- ☆1,098Jan 10, 2026Updated last month
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆109Apr 4, 2025Updated 10 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆472May 17, 2025Updated 9 months ago
- [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"☆40May 22, 2025Updated 9 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,215Aug 27, 2025Updated 6 months ago
- Official Repo for Open-Reasoner-Zero☆2,084Jun 2, 2025Updated 8 months ago
- ☆33Nov 18, 2025Updated 3 months ago
- Reproducing R1 for Code with Reliable Rewards☆288May 5, 2025Updated 9 months ago
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆184May 20, 2025Updated 9 months ago
- ☆813Jun 9, 2025Updated 8 months ago
- ☆331May 31, 2025Updated 8 months ago
- ☆762Dec 23, 2025Updated 2 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆744Jun 6, 2025Updated 8 months ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆243Jul 13, 2025Updated 7 months ago
- Democratizing Reinforcement Learning for LLMs☆5,135Feb 20, 2026Updated last week
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆53Dec 13, 2025Updated 2 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆23Mar 18, 2025Updated 11 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆72Feb 25, 2025Updated last year
- Evaluation of LLMs on latest math competitions☆222Feb 20, 2026Updated last week
- [NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)☆31Aug 8, 2025Updated 6 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆134Jan 31, 2026Updated 3 weeks ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆172Nov 4, 2025Updated 3 months ago
- ☆74Jun 28, 2025Updated 7 months ago
- ☆29Jan 23, 2024Updated 2 years ago
- Simple RL training for reasoning☆3,829Dec 23, 2025Updated 2 months ago
- a-m-team's exploration in large language modeling☆194May 29, 2025Updated 8 months ago
- Scalable RL solution for advanced reasoning of language models☆1,806Mar 18, 2025Updated 11 months ago
- A Sober Look at Language Model Reasoning☆93Nov 18, 2025Updated 3 months ago
- instruction-following benchmark for large reasoning models☆44Aug 9, 2025Updated 6 months ago
- Recipes to train the self-rewarding reasoning LLMs.☆231Mar 2, 2025Updated 11 months ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,559Feb 15, 2026Updated last week
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 3 months ago
- ☆17Apr 9, 2025Updated 10 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Dec 23, 2024Updated last year
- Technical report of Kimina-Prover Preview.☆361Jul 10, 2025Updated 7 months ago
- ☆352Jul 29, 2025Updated 6 months ago