zwhong714 / weak-to-strong-preference-optimization
View external linksLinks

[ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model
16Feb 24, 2025Updated 11 months ago

Alternatives and similar repositories for weak-to-strong-preference-optimization

Users that are interested in weak-to-strong-preference-optimization are comparing it to the libraries listed below

Sorting:

Are these results useful?