Riashat / Bayesian-Exploration-Deep-RLLinks
Bayesian Uncertainty Exploration in Deep Reinforcement Learning
☆18Updated 8 years ago
Alternatives and similar repositories for Bayesian-Exploration-Deep-RL
Users that are interested in Bayesian-Exploration-Deep-RL are comparing it to the libraries listed below
Sorting:
- Some example code for the "Introduction to Bayesian Reinforcement Learning" presentations☆29Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆67Updated 7 years ago
- Robust policy search algorithms which train on model ensembles☆30Updated 8 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Updated 2 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 6 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Updated 7 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- Exploration by Random Network Distillation☆15Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated last year
- Meta Reinforcement Learning Experiments☆34Updated 8 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 7 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- PyRL - Reinforcement Learning Framework in Pytorch (Policy Gradient, DQN, DDPG, TD3, PPO, SAC, etc.)☆34Updated 3 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 6 years ago
- ☆27Updated 6 years ago
- ☆19Updated 6 years ago
- Hierarchical Self-Play☆21Updated 6 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Updated 6 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 6 years ago
- ☆14Updated 6 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆32Updated 7 years ago