mrahtz / learning-from-human-preferences

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
307Updated 2 years ago

Related projects

Alternatives and complementary repositories for learning-from-human-preferences