ducandu / RL-Implementation-IMPALALinks

A Test-Implementation of the IMPALA algorithm (by deepmind 2018)

☆35

Alternatives and similar repositories for RL-Implementation-IMPALA

Users that are interested in RL-Implementation-IMPALA are comparing it to the libraries listed below

Sorting:

tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
neka-nat / distributed_rl
Pytorch implementation of distributed deep reinforcement learning
☆76Updated 2 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
lnpalmer / PPO
PyTorch implementation of Proximal Policy Optimization
☆52Updated 7 years ago
louiskirsch / metagenrl
MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…
☆67Updated 5 years ago
google-research / episodic-curiosity
Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability
☆204Updated 4 years ago
EndingCredits / Neural-Episodic-Control
Implementation of Deepmind's Neural Episodic Control
☆58Updated 7 years ago
islamelnabarawy / sc2gym
PySC2 OpenAI Gym Environments
☆48Updated 6 years ago
wwxFromTju / deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…
☆128Updated 2 years ago
ewanlee / ICLR2019-RL-Papers
The Reinforcement-Learning-Related Papers of ICLR 2019
☆47Updated 6 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆95Updated 2 years ago
mrkulk / hierarchical-deep-RL
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation
☆87Updated 7 years ago
reinforcement-learning-kr / rl-montezuma
The state-of-art deep rl algorithms for Montezuma's revenge
☆27Updated 6 years ago
flowersteam / geppg
☆35Updated 6 years ago
junhyukoh / value-prediction-network
NIPS 2017 Value Prediction Network
☆166Updated 7 years ago
cjm715 / mgym
A collection of multi-agent reinforcement learning OpenAI gym environments
☆45Updated 5 years ago
jvmncs / ParamNoise
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆28Updated 6 years ago
dibyaghosh / dnc
Code for "Divide-and-Conquer Reinforcement Learning"
☆61Updated 6 years ago
cyoon1729 / distributedRL
A framework for easy prototyping of distributed reinforcement learning algorithms
☆95Updated 4 years ago
createamind / DRL
☆92Updated 4 years ago
uber-research / ape-x
This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"
☆190Updated 6 years ago
alexis-jacq / Pytorch-DPPO
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆183Updated 7 years ago
YuhangSong / Arena-Baselines
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆103Updated 3 months ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆125Updated 5 years ago
davidhershey / feudal_networks
An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161
☆183Updated 7 years ago
xlnwel / model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
☆62Updated 4 years ago
jeanharb / option_critic
Implementation of the Option-Critic Architecture on the Atari (ALE) environment
☆177Updated 7 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆95Updated 6 years ago
intel / cerl
☆72Updated 2 years ago
podondra / gym-gridworlds
Gridworld environments for OpenAI gym.
☆80Updated last year