fangyuan-ksgk / Tiny-GRPOLinks
minimal GRPO implementation from scratch
☆92Updated 4 months ago
Alternatives and similar repositories for Tiny-GRPO
Users that are interested in Tiny-GRPO are comparing it to the libraries listed below
Sorting:
- Tina: Tiny Reasoning Models via LoRA☆266Updated last month
- A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.☆112Updated 5 months ago
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)