BY571 / Upside-Down-Reinforcement-LearningLinks
Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.
☆77Updated 5 years ago
Alternatives and similar repositories for Upside-Down-Reinforcement-Learning
Users that are interested in Upside-Down-Reinforcement-Learning are comparing it to the libraries listed below
Sorting:
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆97Updated 4 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆102Updated 2 years ago
- ☆80Updated last year
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆63Updated last year
- A framework for easy prototyping of distributed reinforcement learning algorithms☆96Updated 4 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Updated 4 years ago
- ☆71Updated 2 years ago
- PyTorch code to train and evaluate Procgen tasks☆25Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- ☆35Updated 7 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- Let's solve the flatland challenge!☆73Updated last year
- Highly Modular and Scalable Reinforcement Learning☆117Updated 5 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 4 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆61Updated 6 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆77Updated 5 years ago
- Revisiting Rainbow☆75Updated 4 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆48Updated 4 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆116Updated 5 years ago
- Reinforcement learning algorithms in RLlib☆59Updated last year
- ☆87Updated 4 years ago
- Starter Kit for NeurIPS 2020 - Procgen Competition on AIcrowd☆91Updated 2 years ago
- MultiTask Environments for Reinforcement Learning.☆77Updated 3 years ago
- Generalised UDRL☆37Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆89Updated 4 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆152Updated 3 years ago