wzhouad / WPO

Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"
32Updated 3 months ago

Alternatives and similar repositories for WPO:

Users that are interested in WPO are comparing it to the libraries listed below