RobertTLange / gym-hanoi
A Towers of Hanoi environment in OpenAI Gym Style
☆13Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for gym-hanoi
- Invariant Causal Prediction for Block MDPs☆43Updated 4 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆42Updated last year
- A curated list of papers presented in the 📖"Flexible Learning Reading Group" @ TU Berlin. Join us! 🤗☆27Updated 3 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 3 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆35Updated 5 years ago
- Baselines for gymnax 🤖☆60Updated last year
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 5 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Updated 4 years ago
- krazy grid world☆25Updated 4 years ago
- Generalised UDRL☆37Updated 2 years ago
- ☆66Updated 8 months ago
- Sandbox environment for generalizable agent research☆23Updated 2 years ago
- Tutorials on learning and using successor representations.☆50Updated 5 years ago
- Variational Reinforcement Learning☆16Updated 4 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- ☆36Updated last year
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆27Updated 4 years ago
- TorchingUp provides minimal implementations of common Reinforcement Learning algorithms written in PyTorch. It is meant to complement Ope…☆42Updated last year
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 5 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 5 years ago
- ☆85Updated 3 months ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago