instadeepai / AlphaNPILinks
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
Alternatives and similar repositories for AlphaNPI
Users that are interested in AlphaNPI are comparing it to the libraries listed below
Sorting:
- ☆80Updated last year
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆63Updated last year
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆116Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆95Updated 6 years ago
- ☆44Updated 6 years ago
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 6 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆191Updated 2 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆50Updated 2 years ago
- ☆35Updated 6 years ago
- ☆84Updated 4 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆125Updated 6 years ago
- Some hard problems for reinforcement learning.☆31Updated 6 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Updated 4 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆95Updated 4 years ago
- Augmented environments with RL☆104Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆77Updated 5 years ago
- An environment for benchmarking commonsense agents☆29Updated 4 years ago
- A job launching library for docker, EC2, GCP, etc.☆57Updated 3 years ago