reinforcement-learning-kr / rl-montezuma
The state-of-art deep rl algorithms for Montezuma's revenge
☆25Updated 6 years ago
Alternatives and similar repositories for rl-montezuma:
Users that are interested in rl-montezuma are comparing it to the libraries listed below
- ☆49Updated 5 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆27Updated last month
- ☆83Updated 3 years ago
- Official code for the paper "Learning Transition Policies for Composing Complex Skills" (ICLR 2019)☆73Updated 5 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆53Updated 6 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Distributed Priortized Experience Replay☆10Updated 6 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆42Updated 6 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆94Updated 2 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆61Updated 6 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- ☆42Updated 6 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- ☆69Updated 6 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆47Updated 4 years ago
- Soft Actor-Critic☆144Updated 7 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Updated 5 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆85Updated 5 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 7 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Updated 7 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆94Updated 6 years ago