RLHFlow / RLHF-Reward-ModelingLinks

Recipes to train reward model for RLHF.
1,437Updated 4 months ago

Alternatives and similar repositories for RLHF-Reward-Modeling

Users that are interested in RLHF-Reward-Modeling are comparing it to the libraries listed below

Sorting: