Solves the Tower of Hanoi puzzle by Q-learning
☆27Nov 8, 2017Updated 8 years ago
Alternatives and similar repositories for Q-learning-Hanoi
Users that are interested in Q-learning-Hanoi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Temporal Difference Learning and Basic Reinforcement Learning Demos in Matlab☆16Jul 27, 2016Updated 9 years ago
- Randomized Linear Algebra in Python☆13Mar 21, 2017Updated 9 years ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 8 years ago
- Neural model of hierarchical reinforcement learning☆16Sep 14, 2017Updated 8 years ago
- Q-Learning applied to the classic Travelling Salesman Problem☆19Apr 6, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Jun 22, 2022Updated 3 years ago
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- PyData San Luis 2017 Tutorial: An Introduction to Gaussian Processes in PyMC3☆15Nov 16, 2017Updated 8 years ago
- Convolutional Sparse Coding☆10Jul 18, 2014Updated 11 years ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Aug 3, 2023Updated 2 years ago
- Learn to build neural networks from scratch, simply. No autograd, no deep learning libraries - just numpy.☆10Aug 10, 2022Updated 3 years ago
- PyMC3 implementation of Drew Linzer’s dynamic Bayesian election forecasting model☆12Nov 4, 2016Updated 9 years ago
- ☆11Nov 28, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Dec 12, 2017Updated 8 years ago
- My PhD thesis (in progress!)☆15Oct 23, 2016Updated 9 years ago
- Differentiable Augmentation for Data-Efficient GAN Training☆11Aug 9, 2020Updated 5 years ago
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆21Oct 19, 2025Updated 7 months ago
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- An example repository demonstrating Bazel cc_binary and cc_library build targets.☆11Mar 3, 2016Updated 10 years ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated last year
- Deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and…☆15Sep 12, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A convolutional auto-encoder for compressing time sequence data of stocks.☆12Oct 9, 2017Updated 8 years ago
- emotion recognition through eeg by using HOS method☆10Dec 29, 2021Updated 4 years ago
- a library for deep reinforcement learning, with applications for navigation☆16Feb 6, 2018Updated 8 years ago
- just a neater version of PointNet and PointNet++ in tensorflow☆13May 3, 2018Updated 8 years ago
- RuDaS: Synthetic Datasets for Rule Learning☆20Jun 21, 2022Updated 3 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆15Jul 1, 2018Updated 7 years ago
- Code for recreating the figures in the brainrender paper (Claudi et al. 2020)☆12Dec 11, 2020Updated 5 years ago
- Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch☆16Dec 11, 2020Updated 5 years ago
- Python implementation of the main algorithms of the Learning From Interpretation Transitions (LFIT) framework☆17Apr 28, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Natural Environment Benchmarks for Reinforcement Learning☆23May 9, 2019Updated 7 years ago
- Sample notebooks for Juno☆11Mar 1, 2025Updated last year
- KDD Cup of Fresh Air 2018☆19Apr 12, 2018Updated 8 years ago
- Cat-and-Mouse game with Reinforcement Learning (Q-Learning).☆37Jun 21, 2020Updated 5 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆22Jan 8, 2018Updated 8 years ago
- An approach that can effectively detect similar repositories on GitHub.☆45Oct 23, 2016Updated 9 years ago