liziniu / policy_optimizationView on GitHub
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
28Dec 19, 2023Updated 2 years ago

Alternatives and similar repositories for policy_optimization

Users that are interested in policy_optimization are comparing it to the libraries listed below

Sorting:

Are these results useful?