instadeepai / AlphaNPILinks
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
Alternatives and similar repositories for AlphaNPI
Users that are interested in AlphaNPI are comparing it to the libraries listed below
Sorting:
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆63Updated last year
- Reinforcement Learning Assembly☆92Updated 3 years ago
- ☆35Updated 6 years ago
- ☆80Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 6 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆116Updated 5 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆191Updated 2 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆125Updated 6 years ago
- ☆84Updated 4 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆77Updated 5 years ago
- ☆44Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- Augmented environments with RL☆104Updated 6 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆96Updated 4 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- The Differentiable Cross-Entropy Method☆123Updated 4 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Updated 4 years ago
- ☆31Updated 6 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 6 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆102Updated 2 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 2 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆51Updated 2 years ago
- CompILE: Compositional Imitation Learning and Execution (ICML 2019)☆112Updated 6 years ago
- ☆65Updated last year
- krazy grid world☆25Updated 5 years ago