A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
☆291Sep 25, 2025Updated 6 months ago
Alternatives and similar repositories for DeepMath
Users that are interested in DeepMath are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆744Jun 6, 2025Updated 10 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆224Nov 27, 2025Updated 4 months ago
- ☆1,122Jan 10, 2026Updated 2 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,239Aug 27, 2025Updated 7 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- [NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)☆32Aug 8, 2025Updated 8 months ago
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆110Apr 4, 2025Updated last year
- Reproducing R1 for Code with Reliable Rewards☆302May 5, 2025Updated 11 months ago
- Official Repo for Open-Reasoner-Zero☆2,087Jun 2, 2025Updated 10 months ago
- [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"☆43May 22, 2025Updated 10 months ago
- Democratizing Reinforcement Learning for LLMs☆5,363Updated this week
- ☆334May 31, 2025Updated 10 months ago
- ☆763Dec 23, 2025Updated 3 months ago
- ☆813Jun 9, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆36Nov 18, 2025Updated 4 months ago
- ☆77Jun 28, 2025Updated 9 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆478May 17, 2025Updated 10 months ago
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆189May 20, 2025Updated 10 months ago
- Evaluation of LLMs on latest math competitions☆239Mar 28, 2026Updated last week
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆35Nov 11, 2025Updated 4 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆73Feb 25, 2025Updated last year
- Scalable RL solution for advanced reasoning of language models☆1,837Mar 18, 2025Updated last year
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Reinforcing General Reasoning without Verifiers☆97Jun 24, 2025Updated 9 months ago
- ☆29Jan 23, 2024Updated 2 years ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆747Jun 6, 2025Updated 10 months ago
- Simple RL training for reasoning☆3,846Dec 23, 2025Updated 3 months ago
- ☆73Jun 10, 2025Updated 9 months ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆257Jul 13, 2025Updated 8 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆134Jan 31, 2026Updated 2 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆180Nov 4, 2025Updated 5 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆431Jul 11, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆568May 6, 2025Updated 11 months ago
- a-m-team's exploration in large language modeling☆195May 29, 2025Updated 10 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 4 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆64Jul 8, 2024Updated last year
- Recipes to train the self-rewarding reasoning LLMs.☆231Mar 2, 2025Updated last year
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year