bprabhakar / upside-down-reinforcement-learning
Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.
☆11Updated 4 years ago
Alternatives and similar repositories for upside-down-reinforcement-learning:
Users that are interested in upside-down-reinforcement-learning are comparing it to the libraries listed below
- Generalised UDRL☆37Updated 2 years ago
- A framework for implementing equivariant DL☆10Updated 3 years ago
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- Variational Reinforcement Learning☆16Updated 5 months ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆13Updated 6 years ago
- Clockwork VAEs in JAX/Flax☆32Updated 3 years ago
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 3 years ago
- NeurIPS 2019 Paper Implementation☆12Updated 2 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 6 years ago
- Official repository for the paper "Goal-Conditioned Generators of Deep Policies"☆11Updated 2 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆18Updated 11 months ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆45Updated 5 years ago
- Deep reinforcement learning for adaptation in evolutionary algorithms☆9Updated 5 years ago
- ☆18Updated 2 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆23Updated 4 years ago
- Understanding RL vision Distill article☆23Updated last year
- Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)☆11Updated last year
- ☆17Updated 2 years ago
- Variational Walkback, NIPS'17☆28Updated 7 years ago
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago
- ☆45Updated 5 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 4 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Updated 2 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- ☆22Updated 3 years ago
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆10Updated 3 years ago
- JAX implementation of Graph Attention Networks☆13Updated 2 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 5 years ago