SLIT-AI / WRPO

[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
12Updated 2 weeks ago

Alternatives and similar repositories for WRPO:

Users that are interested in WRPO are comparing it to the libraries listed below