LeapLabTHU / JustGRPOView on GitHub
Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".
133Apr 3, 2026Updated last week

Alternatives and similar repositories for JustGRPO

Users that are interested in JustGRPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?