s-smits / grpo-optunaView on GitHub
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
59Oct 18, 2025Updated 4 months ago

Alternatives and similar repositories for grpo-optuna

Users that are interested in grpo-optuna are comparing it to the libraries listed below

Sorting:

Are these results useful?