instadeepai / AlphaNPILinks

Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.

☆79

Alternatives and similar repositories for AlphaNPI

Users that are interested in AlphaNPI are comparing it to the libraries listed below

Sorting:

BY571 / Upside-Down-Reinforcement-Learning
Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.
☆77Updated 4 years ago
facebookresearch / adversarially-motivated-intrinsic-goals
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
☆63Updated last year
facebookresearch / rela
Reinforcement Learning Assembly
☆92Updated 3 years ago
flowersteam / geppg
☆35Updated 6 years ago
mfranzs / meta-learning-curiosity-algorithms
☆80Updated last year
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆96Updated 6 years ago
pathak22 / modular-assemblies
[NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"
☆116Updated 5 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
ericjang / maml-jax
Implementation of Model-Agnostic Meta-Learning (MAML) in Jax
☆191Updated 2 years ago
pathak22 / exploration-by-disagreement
[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement
☆125Updated 6 years ago
google-deepmind / dm_hard_eight
☆84Updated 4 years ago
uber-research / backpropamine
Train self-modifying neural networks with neuromodulated plasticity
☆77Updated 5 years ago
Feryal / craft-env
☆44Updated 6 years ago
david-abel / rl_abstraction
Code for experimenting with state and action abstractions in reinforcement learning.
☆30Updated 4 years ago
sunblaze-ucb / rl-generalization
Modifiable OpenAI Gym environments for studying generalization in RL
☆87Updated 6 years ago
HumanCompatibleAI / rlsp
Reward Learning by Simulating the Past
☆44Updated 6 years ago
hardmaru / astool
Augmented environments with RL
☆104Updated 6 years ago
senya-ashukha / quantile-regression-dqn-pytorch
A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning
☆96Updated 4 years ago
rddy / ReQueST
Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"
☆84Updated 5 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
facebookresearch / dcem
The Differentiable Cross-Entropy Method
☆123Updated 4 years ago
shagunsodhani / memory-augmented-self-play
PyTorch implementation of Memory Augmented Self-Play
☆52Updated 4 years ago
Feryal / automated-curriculum-rl
☆31Updated 6 years ago
wassname / world-models-sonic-pytorch
Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…
☆32Updated 6 years ago
rraileanu / auto-drac
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆102Updated 2 years ago
geyang / plan2vec
Public Release of Plan2vec Implementation in pyTorch
☆57Updated 2 years ago
koulanurag / mmn
Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
☆51Updated 2 years ago
tkipf / compile
CompILE: Compositional Imitation Learning and Execution (ICML 2019)
☆112Updated 6 years ago
iosband / TabulaRL
☆65Updated last year
bstadie / krazyworld
krazy grid world
☆25Updated 5 years ago