instadeepai / AlphaNPILinks
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
Alternatives and similar repositories for AlphaNPI
Users that are interested in AlphaNPI are comparing it to the libraries listed below
Sorting:
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆95Updated 6 years ago
- ☆44Updated 6 years ago
- ☆85Updated 4 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- krazy grid world☆25Updated 5 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆86Updated 5 years ago
- A job launching library for docker, EC2, GCP, etc.☆57Updated 3 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆95Updated 4 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- ☆35Updated 6 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆116Updated 5 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 7 years ago
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆46Updated last year
- MultiTask Environments for Reinforcement Learning.☆76Updated 2 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆124Updated 6 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 5 years ago
- ☆43Updated 8 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- ☆65Updated last year
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- impact-driven-exploration☆131Updated last year
- Models built with TensorFlow☆25Updated 6 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago