pickxiguapi / Clean-Offline-RLHF

Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
35Updated 11 months ago

Alternatives and similar repositories for Clean-Offline-RLHF:

Users that are interested in Clean-Offline-RLHF are comparing it to the libraries listed below