h2r / burlap_caffe
☆53Updated 8 years ago
Alternatives and similar repositories for burlap_caffe:
Users that are interested in burlap_caffe are comparing it to the libraries listed below
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆120Updated 8 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Updated 8 years ago
- Implementations of deep RL papers and random experimentation☆176Updated 6 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Updated 8 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆43Updated 8 years ago
- Deep Attention Recurrent Q-Network☆115Updated 9 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆181Updated 7 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆192Updated 6 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Updated 8 years ago
- Collection of Deep Reinforcement Learning algorithms☆124Updated 7 years ago
- Value Iteration Networks☆289Updated 7 years ago
- Reinforcement Learning in Python☆107Updated 5 years ago
- Using Asynchronous Deep Reinforcement Learning to play Flappy Bird from pixel input.☆30Updated 7 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆53Updated 5 years ago
- ☆159Updated 7 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆84Updated 9 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆103Updated 5 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- ☆79Updated 6 years ago
- implement of prioritized experience replay☆160Updated 6 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆79Updated 6 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆212Updated 7 years ago
- Multiagent Cooperation and Competition with Deep Reinforcement Learning☆124Updated 9 years ago
- Noisy Networks for Exploration☆186Updated 7 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆52Updated 8 years ago
- Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…☆77Updated 8 years ago