s-smits / grpo-optuna
View external linksLinks

Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
59Oct 18, 2025Updated 3 months ago

Alternatives and similar repositories for grpo-optuna

Users that are interested in grpo-optuna are comparing it to the libraries listed below

Sorting:

Are these results useful?