xiwenc1 / DRA-GRPOView on GitHub
Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models
23Jan 6, 2026Updated 2 months ago

Alternatives and similar repositories for DRA-GRPO

Users that are interested in DRA-GRPO are comparing it to the libraries listed below

Sorting:

Are these results useful?