liziniu / policy_optimization

Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
24Updated last year

Alternatives and similar repositories for policy_optimization:

Users that are interested in policy_optimization are comparing it to the libraries listed below