bprabhakar / upside-down-reinforcement-learning
Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for upside-down-reinforcement-learning
- A framework for implementing equivariant DL☆10Updated 3 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆23Updated 3 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Variational Reinforcement Learning☆16Updated 3 months ago
- Clockwork VAEs in JAX/Flax☆31Updated 3 years ago
- ☆17Updated 2 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆25Updated 2 years ago
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Official repository for the paper "Goal-Conditioned Generators of Deep Policies"☆11Updated 2 years ago
- ☆22Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 3 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 6 years ago
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆10Updated 2 years ago
- NeurIPS 2019 Paper Implementation☆13Updated last year
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆27Updated 4 years ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆13Updated 5 years ago
- ☆29Updated 2 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Updated 2 years ago
- Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)☆11Updated last year
- TaskMet Task-driven Metric Learning for Model Learning☆18Updated 9 months ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 2 years ago
- Variational Walkback, NIPS'17☆28Updated 7 years ago
- JAX implementation of Graph Attention Networks☆13Updated 2 years ago
- ☆20Updated 5 years ago
- Code for the benchmark containing dataset, models and metrics for productive concept learning -- a kind of compositional reasoning task t…☆16Updated 3 years ago
- ☆16Updated 3 years ago
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago