h2r / burlap_caffe
☆53Updated 8 years ago
Alternatives and similar repositories for burlap_caffe:
Users that are interested in burlap_caffe are comparing it to the libraries listed below
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 8 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆53Updated 5 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆192Updated 6 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Updated 7 years ago
- Deep Attention Recurrent Q-Network☆115Updated 9 years ago
- Reinforcement Learning in Python☆107Updated 5 years ago
- Multiagent Cooperation and Competition with Deep Reinforcement Learning☆124Updated 9 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆120Updated 8 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆113Updated 7 years ago
- ☆78Updated 6 years ago
- Collection of Deep Reinforcement Learning algorithms☆124Updated 7 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆181Updated 6 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Updated 8 years ago
- Reinforcement learning benchmarking.☆39Updated 6 years ago
- ☆159Updated 7 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆211Updated 6 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆30Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- Implement A3C for Mujoco gym envs☆72Updated 7 years ago
- Value Iteration Networks☆289Updated 7 years ago
- Yet another prioritized experience replay buffer implementation.☆48Updated 2 years ago
- Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain …☆83Updated 8 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Updated 8 years ago
- Hybrid Reward Architecture☆77Updated 6 years ago
- implement of prioritized experience replay☆158Updated 6 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 6 years ago
- deep reinforcement learning for personal research☆84Updated 7 years ago