supratikp / HOOF
Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583
☆19Updated 5 years ago
Alternatives and similar repositories for HOOF
Users that are interested in HOOF are comparing it to the libraries listed below
Sorting:
- Code for experimenting with state and action abstractions in reinforcement learning.☆31Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Hierarchical Self-Play☆21Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 6 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆46Updated last year
- ☆31Updated 6 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆37Updated 5 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆35Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- ☆35Updated 6 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 5 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Updated 2 years ago
- This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms☆27Updated 3 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- Generalised UDRL☆37Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Updated 4 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆38Updated 2 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- ☆54Updated 7 years ago
- Reinforcement Learning via Latent State Decoding☆30Updated last year
- ☆55Updated 2 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14Updated 7 years ago