s-smits / grpo-optuna

Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
39Updated last month

Alternatives and similar repositories for grpo-optuna:

Users that are interested in grpo-optuna are comparing it to the libraries listed below