s-smits / grpo-optunaLinks
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated 2 months ago
Alternatives and similar repositories for grpo-optuna
Users that are interested in grpo-optuna are comparing it to the libraries listed below
Sorting: