shenao-zhang / reward-augmented-preference

The official implementation of Preference Data Reward-Augmentation.
14Updated last month

Related projects

Alternatives and complementary repositories for reward-augmented-preference