khpeek / Q-learning-Hanoi
Solves the Tower of Hanoi puzzle by Q-learning
☆25Updated 7 years ago
Alternatives and similar repositories for Q-learning-Hanoi
Users that are interested in Q-learning-Hanoi are comparing it to the libraries listed below
Sorting:
- Continual Reinforcement Learning in 3D Non-stationary Environments☆37Updated 5 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated 2 years ago
- ☆28Updated 2 years ago
- ☆31Updated 6 years ago
- Generalised UDRL☆37Updated 3 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Updated 5 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- krazy grid world☆25Updated 5 years ago
- Episodic Control☆20Updated 2 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Updated 2 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- The Machine Learning Toybox for testing the behavior of autonomous agents.☆27Updated 3 years ago
- Tutorials on learning and using successor representations.☆52Updated 5 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆55Updated 2 years ago
- Revisiting Rainbow☆74Updated 3 years ago
- Deep Successor Representation☆17Updated 7 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆38Updated 2 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- TorchingUp provides minimal implementations of common Reinforcement Learning algorithms written in PyTorch. It is meant to complement Ope…☆47Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- ☆22Updated last month
- Neural model of hierarchical reinforcement learning☆16Updated 7 years ago
- Exploring the use of options in creating small worlds for faster learning in RL Domains☆16Updated 13 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- ☆20Updated 5 years ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Updated 5 years ago