snu-mllab / DPPOView on GitHub
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
42Jul 20, 2024Updated last year

Alternatives and similar repositories for DPPO

Users that are interested in DPPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?