RobertTLange / gym-hanoiLinks
A Towers of Hanoi environment in OpenAI Gym Style
☆13Updated 6 years ago
Alternatives and similar repositories for gym-hanoi
Users that are interested in gym-hanoi are comparing it to the libraries listed below
Sorting:
- A Tutorial on Deep Reinforcement Learning in PyTorch☆32Updated last year
- Baselines for gymnax 🤖☆66Updated 2 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Updated 4 years ago
- A curated list of papers presented in the 📖"Flexible Learning Reading Group" @ TU Berlin. Join us! 🤗☆27Updated 4 years ago
- ☆53Updated 7 months ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Implicit Normalizing Flows + Reinforcement Learning☆61Updated 6 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 5 years ago
- krazy grid world☆25Updated 5 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Python implementation of Bayesian Program Learning tools (with PyTorch)☆72Updated 2 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆38Updated 5 years ago
- Tutorials on learning and using successor representations.☆52Updated 5 years ago
- ☆122Updated last year
- Proximal Policy Optimization with Stein Control Variates:☆33Updated 7 years ago
- Reinforcement learning library in JAX.☆100Updated last year
- ☆31Updated 6 years ago
- ☆86Updated 10 months ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Invariant Causal Prediction for Block MDPs☆44Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆49Updated 3 years ago
- ☆85Updated 4 years ago
- ☆86Updated 3 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Updated 4 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- ☆14Updated 6 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago