pickxiguapi / Clean-Offline-RLHF

Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
33Updated 10 months ago

Alternatives and similar repositories for Clean-Offline-RLHF:

Users that are interested in Clean-Offline-RLHF are comparing it to the libraries listed below