HankyuJang / Non-Stationary-Reinforcement-Learning-Links

☆8

Alternatives and similar repositories for Non-Stationary-Reinforcement-Learning-

Users that are interested in Non-Stationary-Reinforcement-Learning- are comparing it to the libraries listed below

Sorting:

ludobouan / Q-learning-gridworld
Reinforcement learning on gridworld with Q-learning
☆10Updated 8 years ago
mcmachado / options
☆43Updated 8 years ago
lcalem / reproduction-soft-qlearning-mutual-information
Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.
☆10Updated 6 years ago
google-research / deep_ope
☆86Updated 11 months ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
aa14k / Exploration-in-RL
☆28Updated last year
social-dilemma / multiagent
Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas
☆49Updated 2 years ago
Officium / RL-Experiments
High-quality implementations of deep reinforcement learning algorithms for experiments
☆51Updated 10 months ago
siemens / industrialbenchmark
Industrial Benchmark
☆130Updated 2 years ago
causal-rl-anonymous / causal-rl
☆44Updated 3 years ago
yfletberliac / rlss-2019
Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
☆88Updated 5 years ago
sunblaze-ucb / rl-generalization
Modifiable OpenAI Gym environments for studying generalization in RL
☆87Updated 6 years ago
RomainLaroche / SPIBB
Safe Policy Improvement with Baseline Bootstrapping
☆26Updated 5 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 5 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆66Updated 7 years ago
yaoliucs / PQL
Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"
☆12Updated 4 years ago
JKCooper2 / gym-bandits
Bandits Environments for the OpenAI Gym
☆89Updated 5 years ago
rlai-lab / Regularized-GradientTD
Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.
☆36Updated 4 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆96Updated 2 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago
hiwonjoon / ICML2019-TREX
☆84Updated 4 years ago
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆75Updated 4 years ago
schroederdewitt / mackrl
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆33Updated 5 years ago
nathangrinsztajn / Box-World
Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"
☆46Updated last year
louiskirsch / metagenrl
MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…
☆67Updated 5 years ago
DavidJanz / successor_uncertainties_atari
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Updated 2 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆95Updated 6 years ago
nnaisense / MAGE
Learning Action-Value Gradients in Model-based Policy Optimization
☆31Updated 3 years ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆94Updated 3 weeks ago