pickxiguapi / Clean-Offline-RLHF

Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
31Updated 7 months ago

Related projects

Alternatives and complementary repositories for Clean-Offline-RLHF