csiro-mlai / RL-BOEDLinks
☆14Updated 3 years ago
Alternatives and similar repositories for RL-BOED
Users that are interested in RL-BOED are comparing it to the libraries listed below
Sorting:
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆43Updated last week
- PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020☆45Updated 4 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆55Updated 4 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆91Updated 4 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆105Updated 4 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Updated 5 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- ☆88Updated last year
- on-policy optimization baselines for deep reinforcement learning☆32Updated 5 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Updated 2 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Updated 3 years ago
- ☆51Updated 3 years ago
- ☆32Updated last year
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆196Updated 2 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆161Updated 5 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆189Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆101Updated 3 years ago
- ☆31Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆95Updated 3 months ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆128Updated 4 years ago
- Our version of #Exploration: A Study of Count-Based Explorationfor Deep Reinforcement Learning for a class project☆16Updated 4 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 2 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 4 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 3 years ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆24Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- ☆26Updated 2 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 6 months ago
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year