wshuail / reinforcement_learning_with_TensorflowLinks
Minimal implementations of reinforcement learning algorithms by Tensorflow
☆29Updated 8 years ago
Alternatives and similar repositories for reinforcement_learning_with_Tensorflow
Users that are interested in reinforcement_learning_with_Tensorflow are comparing it to the libraries listed below
Sorting:
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆152Updated 2 years ago
- Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…☆79Updated 8 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 7 years ago
- Reinforcement Learning in Python☆108Updated 5 years ago
- reimplementation of the ddpg algorithm using tensorflow☆38Updated 9 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 7 years ago
- Deep RL Algorithms implemented for UC Berkeley's CS 294-112: Deep Reinforcement Learning☆141Updated 8 years ago
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆160Updated 5 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆56Updated 6 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Updated 6 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- Duel_DDQN (Dueling Network Architectures + Double DQN) using Keras☆31Updated 9 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Updated 7 years ago
- Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874☆47Updated 4 years ago
- A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env☆70Updated 8 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Updated 8 years ago
- Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.☆66Updated 7 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Updated 6 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Updated 7 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆193Updated last year
- Reinforcement learning benchmarking.☆39Updated 7 years ago
- Project 3 of Udacity's Deep Reinforcement Learning nanodegree program.☆45Updated 6 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆80Updated 6 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆70Updated 8 years ago
- Collaborative Deep Reinforcement Learning☆31Updated 8 years ago
- ☆53Updated 8 years ago
- A simple reinforcement learning simulation engine for OpenAI's gym.☆39Updated 7 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Updated 8 years ago
- implement of prioritized experience replay☆159Updated 7 years ago
- Reinforcement Learning using Policy Gradient to solve OpenAI Gym games☆112Updated 7 years ago