xiwenc1 / DRA-GRPO
View external linksLinks

Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models
21Jan 6, 2026Updated last month

Alternatives and similar repositories for DRA-GRPO

Users that are interested in DRA-GRPO are comparing it to the libraries listed below

Sorting:

Are these results useful?