DBaudry / Sub-Sampling-Dueling-Algorithms-Neurips20
☆9Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Sub-Sampling-Dueling-Algorithms-Neurips20
- Implementation of Russo and Van Roy work on Information Directed Sampling (2017)☆21Updated 5 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆44Updated 4 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆24Updated 5 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆9Updated 5 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54Updated 5 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆44Updated last year
- ☆14Updated 5 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆25Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- ☆97Updated last year
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 4 years ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- ☆81Updated 3 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- ☆9Updated 4 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆78Updated 5 years ago
- ☆44Updated 5 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- Logarithmic Reinforcement Learning☆26Updated last year
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆68Updated last year
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆18Updated 5 years ago
- ☆53Updated 6 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆69Updated last year
- ☆42Updated 7 years ago
- ☆35Updated 6 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆15Updated 4 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago