RobertTLange / gym-hanoi
A Towers of Hanoi environment in OpenAI Gym Style
☆13Updated 5 years ago
Alternatives and similar repositories for gym-hanoi:
Users that are interested in gym-hanoi are comparing it to the libraries listed below
- Baselines for gymnax 🤖☆63Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Reinforcement Learning with Latent Flow☆43Updated 3 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 4 years ago
- ☆22Updated 2 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆102Updated 2 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆37Updated 5 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- ☆85Updated 6 months ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 3 years ago
- A curated list of papers presented in the 📖"Flexible Learning Reading Group" @ TU Berlin. Join us! 🤗☆27Updated 4 years ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆19Updated 5 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 4 years ago
- ☆84Updated 3 years ago
- ☆29Updated 4 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 5 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Updated 4 years ago
- Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration☆30Updated 4 years ago
- Benchmark data for d3rlpy☆20Updated last year
- Generalised UDRL☆37Updated 2 years ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- Tutorials on learning and using successor representations.☆50Updated 5 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆21Updated 2 years ago
- using information theory to encourage agents to cooperate and compete☆19Updated 6 years ago