fangyuan-ksgk / Tiny-GRPOLinks
minimal GRPO implementation from scratch
☆97Updated 6 months ago
Alternatives and similar repositories for Tiny-GRPO
Users that are interested in Tiny-GRPO are comparing it to the libraries listed below
Sorting:
- A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.☆125Updated 7 months ago
- Tina: Tiny Reasoning Models via LoRA☆282Updated last month
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)