shenao-zhang / reward-augmented-preference

The official implementation of Preference Data Reward-Augmentation.
14Updated 3 weeks ago

Related projects

Alternatives and complementary repositories for reward-augmented-preference