superlinear-ai / microGRPOLinks

🐭 A tiny single-file implementation of Group Relative Policy Optimization (GRPO) as introduced by the DeepSeekMath paper
β˜†38Updated 3 months ago

Alternatives and similar repositories for microGRPO

Users that are interested in microGRPO are comparing it to the libraries listed below

Sorting: