RobertTLange / gym-hanoiLinks
A Towers of Hanoi environment in OpenAI Gym Style
☆13Updated 6 years ago
Alternatives and similar repositories for gym-hanoi
Users that are interested in gym-hanoi are comparing it to the libraries listed below
Sorting:
- A Tutorial on Deep Reinforcement Learning in PyTorch☆33Updated 2 years ago
- A curated list of papers presented in the 📖"Flexible Learning Reading Group" @ TU Berlin. Join us! 🤗☆27Updated 4 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 5 years ago
- Baselines for gymnax 🤖☆71Updated 2 years ago
- Some small scale experiments for my blog posts 📝☆79Updated 3 years ago
- ☆125Updated 2 years ago
- ☆87Updated last year
- Neuro-evolution for OpenAI Gym environments☆57Updated 4 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆78Updated 5 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆64Updated 2 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 4 years ago
- An easy-to-use reinforcement learning library for research and education.☆171Updated last week
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆45Updated 2 years ago
- Contains all materials for the paper "A counterfactual simulation model of causal judgment".☆24Updated 4 years ago
- Python implementation of Bayesian Program Learning tools (with PyTorch)☆74Updated 3 years ago
- A Python Toolkit for Managing a Large Number of Experiments☆32Updated last year
- Tutorials on learning and using successor representations.☆52Updated 5 years ago
- Unified notation for Markov Decision Processes PO(MDP)s☆24Updated 7 years ago
- Code for "Recurrent Independent Mechanisms"☆118Updated 3 years ago
- ☆55Updated 10 months ago
- Reinforcement learning library in JAX.☆100Updated last year
- ☆84Updated 4 years ago
- Library to compare and evaluate reward functions☆67Updated last year
- ☆54Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- ☆31Updated 6 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- krazy grid world☆25Updated 5 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆47Updated last year