superlinear-ai / microGRPOLinks

🐭 A tiny single-file implementation of Group Relative Policy Optimization (GRPO) as introduced by the DeepSeekMath paper
35Updated last month

Alternatives and similar repositories for microGRPO

Users that are interested in microGRPO are comparing it to the libraries listed below

Sorting: