nottombrown / rl-teacher
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
☆559Updated last year
Related projects ⓘ
Alternatives and complementary repositories for rl-teacher
- Implementation of Meta-RL A3C algorithm☆400Updated 7 years ago
- A highly-customisable gridworld game engine with some batteries included. Make your own gridworld games to test reinforcement learning ag…☆660Updated 5 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆341Updated 5 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,097Updated 7 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆691Updated 5 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆400Updated 7 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆415Updated 11 months ago
- Asynchronous Methods for Deep Reinforcement Learning☆592Updated 6 years ago
- Code for the paper "Meta-Learning Shared Hierarchies"☆614Updated last year
- Implementation of TRPO and related algorithms☆620Updated 6 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆655Updated 4 years ago
- An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.☆326Updated 6 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,419Updated last year
- Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning☆206Updated 7 years ago
- A starter agent that can solve a number of universe environments.☆1,101Updated 6 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆264Updated 6 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆360Updated 4 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆416Updated 5 years ago
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆986Updated 5 years ago
- Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for Deep Reinforcement Learning"☆1,013Updated 6 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆306Updated 2 years ago
- Implementations of Reinforcement Learning Models in Tensorflow☆487Updated 7 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆211Updated 6 years ago
- Efficient Batched Reinforcement Learning in TensorFlow☆968Updated 5 years ago
- ☆305Updated last year
- Collection of Deep Reinforcement Learning algorithms☆297Updated 5 years ago
- Soft Actor-Critic☆1,001Updated 11 months ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆193Updated 6 years ago
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning☆436Updated 5 years ago