pickxiguapi / Clean-Offline-RLHFView on GitHub
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
41Mar 26, 2024Updated 2 years ago

Alternatives and similar repositories for Clean-Offline-RLHF

Users that are interested in Clean-Offline-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?