Mohammadjafari80 / GSM8K-RLVRLinks
A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.
☆117Updated 5 months ago
Alternatives and similar repositories for GSM8K-RLVR
Users that are interested in GSM8K-RLVR are comparing it to the libraries listed below
Sorting:
- Tina: Tiny Reasoning Models via LoRA☆272Updated 2 months ago
- The official evaluation suite and dynamic data release for MixEval.☆242Updated 8 months ago
- Reproducible, flexible LLM evaluations