bprabhakar / upside-down-reinforcement-learningLinks

Pytorch based implementation of Upside Down Reinforcement Learning (UDRL) by J. Schmidhuber et al.

☆11

Alternatives and similar repositories for upside-down-reinforcement-learning

Users that are interested in upside-down-reinforcement-learning are comparing it to the libraries listed below

Sorting:

EleutherAI / equivariance
A framework for implementing equivariant DL
☆10Updated 4 years ago
jscriptcoder / Upside-Down-Reinforcement-Learning
Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)
☆11Updated last year
distillpub / post--understanding-rl-vision
Understanding RL vision Distill article
☆23Updated 2 years ago
kachayev / gym-microrts-paper-sb3
RL agent to play μRTS with Stable-Baselines3 and PyTorch
☆26Updated 3 years ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
ThomasMiconi / Meta-Task-Generator
Automatically generate simple meta-learning tasks from a very large space
☆15Updated last year
lucidrains / ESBN-pytorch
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆25Updated 4 years ago
geyang / plan2vec
Public Release of Plan2vec Implementation in pyTorch
☆56Updated 2 years ago
enlite-ai / maze_smaac
Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze
☆10Updated 3 years ago
joelouismarino / variational_rl
Variational Reinforcement Learning
☆16Updated 11 months ago
IDSIA / GoGePo
Official repository for the paper "Goal-Conditioned Generators of Deep Policies"
☆11Updated last month
google-deepmind / affordances_option_models
☆23Updated 3 years ago
locuslab / ase
Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…
☆11Updated 4 years ago
jan-schuchardt / learning-to-evolve
Deep reinforcement learning for adaptation in evolutionary algorithms
☆9Updated 5 years ago
Kajiyu / kanerva_machine
The implementation of "The Kanerva Machine" with Pytorch and Pyro
☆12Updated 7 years ago
IDSIA / recurrent-fwp
Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)
☆49Updated last month
facebookresearch / neural-scs
Neural Fixed-Point Acceleration for Convex Optimization
☆29Updated 2 years ago
mbchang / decentralized-rl
Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)
☆43Updated 2 years ago
kyunghyuncho / backprop-kalman-filter
☆45Updated 5 years ago
cair / PyTsetlinMachineCUDA
Massively Parallel and Asynchronous Architecture for Logic-based AI
☆42Updated 2 years ago
juliuskunze / cwvae-jax
Clockwork VAEs in JAX/Flax
☆32Updated 3 years ago
jcoreyes / evolvingrl
Supplementary Data for Evolving Reinforcement Learning Algorithms
☆46Updated 4 years ago
Abhishaike / HyperProtoNetReproduce
NeurIPS 2019 Paper Implementation
☆12Updated 2 years ago
vtopt / qnstop
Quasi-Newton Algorithm for Stochastic Optimization
☆10Updated 3 years ago
attentionagent / attentionagent.github.io
Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)
☆21Updated 3 years ago
two2tee / WorldModelPlanning
☆16Updated 4 years ago
davidmatthews1uvm / 2019-IROS
Code for D. Matthews, S. Kriegman, C. Cappelle and J. Bongard, "Word2vec to behavior: morphology facilitates the grounding of language in…
☆15Updated 5 years ago
ML-KULeuven / klay
Sparse Circuits on the GPU (ICLR2025)
☆12Updated last month
yangkevin2 / neurips2021-lap3
☆17Updated 3 years ago
kachayev / dataclasses-tensor
Easily serialize dataclasses to and from tensors (PyTorch, NumPy)
☆18Updated 4 years ago