junjungoal / IMPALA-pytorchLinks

PyTorch IMPALA implementation

☆27

Alternatives and similar repositories for IMPALA-pytorch

Users that are interested in IMPALA-pytorch are comparing it to the libraries listed below

Sorting:

eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆96Updated 3 years ago
rraileanu / idaac
☆54Updated last year
alversafa / option-critic-arch
Implementation of the Option-Critic Architecture
☆40Updated 6 years ago
johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆54Updated 4 months ago
tesslerc / GAC
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Updated 5 years ago
spitis / mrl
☆113Updated 2 years ago
sfujim / LAP-PAL
Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
☆36Updated 3 years ago
ermongroup / MetaIRL
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
☆73Updated 2 years ago
navneet-nmk / Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
☆60Updated 6 years ago
pokaxpoka / sunrise
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
☆125Updated 4 years ago
toshikwa / slac.pytorch
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).
☆93Updated last year
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
kngwyu / Rainy
Deep RL agents with PyTorch
☆35Updated 3 years ago
tedmoskovitz / TOP
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Updated 2 years ago
younggyoseo / pytorch-nfsp
Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)
☆46Updated 6 years ago
mengf1 / CHER
Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)
☆65Updated 5 years ago
RLAgent / state-marginal-matching
Efficient Exploration via State Marginal Matching (2019)
☆69Updated 6 years ago
robintyh1 / onpolicybaselines
on-policy optimization baselines for deep reinforcement learning
☆30Updated 5 years ago
schroederdewitt / mackrl
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆32Updated 5 years ago
mengf1 / DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆66Updated 5 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
neka-nat / distributed_rl
Pytorch implementation of distributed deep reinforcement learning
☆76Updated 3 years ago
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆50Updated this week
YYCAAA / V-MPO_Lunarlander
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆48Updated 4 years ago
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆75Updated 4 years ago
louiskirsch / metagenrl
MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…
☆67Updated 5 years ago
Farama-Foundation / D4RL-Evaluations
☆199Updated 2 years ago
wendelinboehmer / dcg
☆75Updated last year
apexrl / bmpo
Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Updated 2 years ago