joschu / modular_rl
Implementation of TRPO and related algorithms
☆620Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for modular_rl
- Code for the paper "Generative Adversarial Imitation Learning"☆690Updated 5 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆360Updated 4 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆655Updated 4 years ago
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆275Updated 6 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆416Updated 11 months ago
- Asynchronous Methods for Deep Reinforcement Learning☆592Updated 6 years ago
- Collection of Deep Reinforcement Learning algorithms☆297Updated 5 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆400Updated 7 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆211Updated 6 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆391Updated 6 years ago
- ☆305Updated last year
- Reimplementation of DDPG(Continuous Control with Deep Reinforcement Learning) based on OpenAI Gym + Tensorflow☆555Updated 3 years ago
- Hacks for training RL systems from John Schulman's lecture at Deep RL Bootcamp (Aug 2017)☆1,099Updated 7 years ago
- A3C LSTM Atari with Pytorch plus A3G design☆562Updated last year
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆987Updated 5 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆341Updated 5 years ago
- Code for the paper "Meta-Learning Shared Hierarchies"☆614Updated last year
- Using Keras and Deep Deterministic Policy Gradient to play TORCS☆717Updated 6 years ago
- Implementations of Reinforcement Learning Models in Tensorflow☆487Updated 7 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- Implementation of Meta-RL A3C algorithm☆400Updated 7 years ago
- Efficient Batched Reinforcement Learning in TensorFlow☆968Updated 5 years ago
- PyTorch implementation of Trust Region Policy Optimization☆433Updated 6 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆307Updated 3 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,230Updated 5 years ago
- Lua/Torch implementation of DQN (Nature, 2015)☆589Updated 7 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆193Updated 6 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,222Updated 11 months ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,419Updated last year
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆264Updated 6 years ago