zwhong714 / weak-to-strong-preference-optimizationLinks

[ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model
13Updated 4 months ago

Alternatives and similar repositories for weak-to-strong-preference-optimization

Users that are interested in weak-to-strong-preference-optimization are comparing it to the libraries listed below

Sorting: