khpeek / Q-learning-HanoiLinks
Solves the Tower of Hanoi puzzle by Q-learning
☆27Updated 7 years ago
Alternatives and similar repositories for Q-learning-Hanoi
Users that are interested in Q-learning-Hanoi are comparing it to the libraries listed below
Sorting:
- ☆31Updated 6 years ago
- Fully differentiable RL environments, written in Ivy.☆65Updated 2 years ago
- rlcourse-march-17-hugobb created by GitHub Classroom☆16Updated last year
- Generalised UDRL☆37Updated 3 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 4 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- ☆28Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆64Updated 2 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Updated 4 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 6 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆65Updated 2 years ago
- ☆125Updated 2 years ago
- Revisiting Rainbow☆75Updated 4 years ago
- krazy grid world☆25Updated 5 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago
- ☆31Updated 3 years ago
- Library to compare and evaluate reward functions☆67Updated 2 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆22Updated 3 years ago
- Vectorization techniques for fast population-based training.☆56Updated 3 years ago
- ☆84Updated 4 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆61Updated 6 years ago
- TorchingUp provides minimal implementations of common Reinforcement Learning algorithms written in PyTorch. It is meant to complement Ope…☆53Updated 2 years ago
- Collection of reinforcement learning algorithms☆16Updated last month
- Tutorials on learning and using successor representations.☆54Updated 6 years ago
- An implementation of MuZero in JAX.☆57Updated 2 years ago