s-smits / grpo-optunaLinks

Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
53Updated 3 months ago

Alternatives and similar repositories for grpo-optuna

Users that are interested in grpo-optuna are comparing it to the libraries listed below

Sorting: