Mohammadjafari80 / GSM8K-RLVR

A simplified implementation for experimenting with Reinforcement Learning (RL) on GSM8K, inspired by RLVR and Deepseek R1. This repository provides a starting point for exploring RL-based reasoning.
72Updated last month

Alternatives and similar repositories for GSM8K-RLVR:

Users that are interested in GSM8K-RLVR are comparing it to the libraries listed below