SLIT-AI / WRPO

[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
13Updated last month

Alternatives and similar repositories for WRPO:

Users that are interested in WRPO are comparing it to the libraries listed below