joschu / modular_rl
Implementation of TRPO and related algorithms
☆632Updated 6 years ago
Alternatives and similar repositories for modular_rl
Users that are interested in modular_rl are comparing it to the libraries listed below
Sorting:
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆274Updated 7 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Updated 4 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆711Updated 6 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆214Updated 7 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆423Updated last year
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆656Updated 5 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆590Updated 6 years ago
- Implementation of Meta-RL A3C algorithm☆403Updated 8 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆394Updated 6 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆403Updated 8 years ago
- Reimplementation of DDPG(Continuous Control with Deep Reinforcement Learning) based on OpenAI Gym + Tensorflow☆561Updated 3 years ago
- Collection of Deep Reinforcement Learning algorithms☆300Updated 6 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆343Updated 6 years ago
- A3C LSTM Atari with Pytorch plus A3G design☆566Updated 2 years ago
- Implementations of Reinforcement Learning Models in Tensorflow☆487Updated 7 years ago
- ☆303Updated 2 years ago
- Using Keras and Deep Deterministic Policy Gradient to play TORCS☆722Updated 7 years ago
- Learning to Communicate with Deep Multi-Agent Reinforcement Learning☆436Updated 6 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,109Updated 7 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,269Updated 5 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆309Updated 4 years ago
- Actor-critic with experience replay☆252Updated 2 years ago
- Basic DQN implementation☆225Updated 7 years ago
- Implementations of deep RL papers and random experimentation☆176Updated 7 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆2,966Updated last year
- Code for the paper "Meta-Learning Shared Hierarchies"☆613Updated last year
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆260Updated 7 months ago
- PyTorch implementation of Trust Region Policy Optimization☆441Updated 6 years ago
- implement of prioritized experience replay☆159Updated 6 years ago
- Constrained Policy Optimization☆320Updated 7 years ago