RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.
903Updated this week

Related projects

Alternatives and complementary repositories for RLHF-Reward-Modeling