MatheusMRFM / A3C-LSTM-with-TensorflowLinks

An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.

☆29

Alternatives and similar repositories for A3C-LSTM-with-Tensorflow

Users that are interested in A3C-LSTM-with-Tensorflow are comparing it to the libraries listed below

Sorting:

liampetti / A3C-LSTM
A3C-LSTM algorithm tested on CartPole OpenAI Gym environment
☆48Updated 6 years ago
xinleipan / gym-gridworld
Simple grid-world environment compatible with OpenAI-gym
☆50Updated 5 years ago
jeanharb / option_critic
Implementation of the Option-Critic Architecture on the Atari (ALE) environment
☆177Updated 7 years ago
mrkulk / hierarchical-deep-RL
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation
☆87Updated 7 years ago
pkumusic / E-DRL
Exploration Strategies for Deep Reinforcement Learning
☆39Updated 6 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
davidhershey / feudal_networks
An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161
☆183Updated 7 years ago
tesslerc / H-DRLN
Hierarchical Deep RL Network
☆31Updated 8 years ago
NeuroCSUT / DeepMind-Atari-Deep-Q-Learner-2Player
Multiagent Cooperation and Competition with Deep Reinforcement Learning
☆124Updated 9 years ago
andreimuntean / A3C
Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.
☆66Updated 7 years ago
Kaixhin / NoisyNet-A3C
Noisy Networks for Exploration
☆186Updated 7 years ago
Damcy / prioritized-experience-replay
implement of prioritized experience replay
☆159Updated 6 years ago
florensacc / snn4hrl
Stochastic Neural Networks for Hierarchical Reinforcement Learning
☆96Updated 7 years ago
spiglerg / DQN_DDQN_Dueling_and_DDPG_Tensorflow
Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…
☆77Updated 8 years ago
wojzaremba / trpo
☆101Updated 8 years ago
steveKapturowski / tensorflow-rl
Implementations of deep RL papers and random experimentation
☆176Updated 7 years ago
junhyukoh / self-imitation-learning
ICML 2018 Self-Imitation Learning
☆278Updated 5 years ago
wulfebw / hierarchical_rl
hierarchical deep reinforcement learning algorithms
☆41Updated 7 years ago
stevenpjg / ddpg-aigym
Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…
☆274Updated 7 years ago
alexis-jacq / Pytorch-DPPO
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆183Updated 7 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
andrewliao11 / gail-tf
Tensorflow implementation of generative adversarial imitation learning
☆199Updated 7 years ago
openai / baselines-results
☆117Updated 4 years ago
jcwleo / mario_rl
☆69Updated 6 years ago
Alfo5123 / Robust-Multitask-RL
Machine Learning Course Project Skoltech 2018
☆108Updated 6 years ago
uidilr / gail_ppo_tf
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
☆115Updated 6 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆125Updated 5 years ago
Maluuba / hra
Hybrid Reward Architecture
☆77Updated 7 years ago
Silvicek / distributional-dqn
Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…
☆132Updated 6 years ago
uber-research / ape-x
This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"
☆190Updated 6 years ago