HankyuJang / Non-Stationary-Reinforcement-Learning-Links
☆8Updated 7 years ago
Alternatives and similar repositories for Non-Stationary-Reinforcement-Learning-
Users that are interested in Non-Stationary-Reinforcement-Learning- are comparing it to the libraries listed below
Sorting:
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Reinforcement learning on gridworld with Q-learning☆10Updated 8 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Updated 6 years ago
- ☆43Updated 8 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 7 years ago
- Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)☆43Updated 4 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆12Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 6 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 5 years ago
- ☆83Updated 4 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 8 years ago
- This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms☆27Updated 3 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆34Updated 8 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆95Updated 2 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 7 years ago
- ☆35Updated 6 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26Updated 5 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 2 years ago
- ☆27Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆95Updated 6 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- ☆86Updated 10 months ago
- Proximal Policy Optimization with Stein Control Variates:☆33Updated 7 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 9 months ago