wzhouad / WPO

Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"
29Updated last month

Related projects

Alternatives and complementary repositories for WPO