mrahtz / learning-from-human-preferencesLinks

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"

☆325

Alternatives and similar repositories for learning-from-human-preferences

Users that are interested in learning-from-human-preferences are comparing it to the libraries listed below

Sorting:

hardmaru / WorldModelsExperiments
World Models Experiments
☆652Updated 2 years ago
lcswillems / rl-starter-files
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
☆697Updated last year
lilianweng / deep-reinforcement-learning-gym
Deep reinforcement learning model implementation in Tensorflow + OpenAI gym
☆299Updated 2 years ago
jachiam / rl-intro
☆136Updated 7 years ago
ctallec / world-models
Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch
☆621Updated 3 years ago
david-abel / simple_rl
A simple framework for experimenting with Reinforcement Learning in Python.
☆317Updated last year
openai / spinningup-workshop
For educational materials related to the spinning up workshops.
☆202Updated 6 years ago
uber-research / go-explore
Code for Go-Explore: a New Approach for Hard-Exploration Problems
☆574Updated 2 years ago
google-deepmind / dm_env
A Python interface for reinforcement learning environments
☆372Updated 2 years ago
MattChanTK / gym-maze
A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.
☆371Updated last year
Kaixhin / spinning-up-basic
Basic versions of agents from Spinning Up in Deep RL written in PyTorch
☆204Updated 4 years ago
google-research / batch_rl
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games
☆553Updated 2 years ago
openai / gym-soccer
☆303Updated 2 years ago
openai / coinrun
Code for the paper "Quantifying Transfer in Reinforcement Learning"
☆398Updated last year
google-research / episodic-curiosity
Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability
☆204Updated 4 years ago
Kaixhin / PlaNet
Deep Planning Network: Control from pixels by latent planning with learned dynamics
☆371Updated 3 years ago
crowdAI / marLo
Multi Agent Reinforcement Learning using MalmÖ
☆258Updated 5 years ago
google-deepmind / dqn_zoo
DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…
☆476Updated last year
haarnoja / softqlearning
Reinforcement Learning with Deep Energy-Based Policies
☆430Updated last year
lcswillems / torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
☆203Updated 2 years ago
danijar / dreamer
Dream to Control: Learning Behaviors by Latent Imagination
☆547Updated 3 years ago
jannerm / mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆504Updated 2 years ago
katerakelly / oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
☆497Updated 2 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
pat-coady / trpo
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆360Updated 5 years ago
greydanus / baby-a3c
A high-performance Atari A3C agent in 180 lines of PyTorch
☆171Updated 4 years ago
kashif / firedup
Clone of OpenAI's Spinning Up in PyTorch
☆151Updated 3 years ago
seungjaeryanlee / awesome-rl-competitions
List of competitions related to Reinforcement Learning
☆350Updated last year
medipixel / rl_algorithms
Structural implementation of RL key algorithms
☆513Updated 2 years ago
openai / atari-py
A packaged and slightly-modified version of https://github.com/bbitmaster/ale_python_interface
☆384Updated 2 years ago