Solves the Tower of Hanoi puzzle by Q-learning
☆27Nov 8, 2017Updated 8 years ago
Alternatives and similar repositories for Q-learning-Hanoi
Users that are interested in Q-learning-Hanoi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Oct 29, 2018Updated 7 years ago
- Dynamic Power Management using Reinforcement Learning for IoT devices.☆11Oct 23, 2021Updated 4 years ago
- Randomized Linear Algebra in Python☆13Mar 21, 2017Updated 9 years ago
- flexible meta-learning in jax☆16Oct 19, 2023Updated 2 years ago
- JAX implementation of the Mistral 7b v0.1 model☆13Mar 27, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Formal Language Tools for Robots☆15Jun 29, 2016Updated 9 years ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 8 years ago
- Repository for UMD CS Course: Introduction to Data Science I: Preparing, Storing, and Manipulating Data☆17Dec 13, 2014Updated 11 years ago
- Neural model of hierarchical reinforcement learning☆16Sep 14, 2017Updated 8 years ago
- Q-Learning applied to the classic Travelling Salesman Problem☆19Apr 6, 2017Updated 9 years ago
- ☆18Apr 17, 2026Updated 2 months ago
- Illustration of counterfactual inference following Ferenc Huszar example☆13Aug 15, 2025Updated 10 months ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 9 months ago
- Convolutional Sparse Coding☆10Jul 18, 2014Updated 11 years ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Apr 5, 2017Updated 9 years ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Aug 3, 2023Updated 2 years ago
- Hands-On Q-Learning with Python, published by Packt☆29Jan 30, 2023Updated 3 years ago
- Learn to build neural networks from scratch, simply. No autograd, no deep learning libraries - just numpy.☆10Aug 10, 2022Updated 3 years ago
- PyMC3 implementation of Drew Linzer’s dynamic Bayesian election forecasting model☆12Nov 4, 2016Updated 9 years ago
- My PhD thesis (in progress!)☆15Oct 23, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆21Oct 19, 2025Updated 7 months ago
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 7 years ago
- A metaheuristic algorithm framework for solving discrete optimization problems☆20Jul 13, 2013Updated 12 years ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated 2 years ago
- PyTorch implementation of linear and convolutional layers with fixed, random feedback weights.☆15Mar 14, 2021Updated 5 years ago
- Deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and…☆15Sep 12, 2018Updated 7 years ago
- A convolutional auto-encoder for compressing time sequence data of stocks.☆12Oct 9, 2017Updated 8 years ago
- a library for deep reinforcement learning, with applications for navigation☆16Feb 6, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- just a neater version of PointNet and PointNet++ in tensorflow☆13May 3, 2018Updated 8 years ago
- Karras et al. (2022) diffusion models for PyTorch☆11Aug 23, 2022Updated 3 years ago
- Numenta's experimental C++ research code. Please see htmresearch for more details.☆27Jul 26, 2019Updated 6 years ago
- ☆22Sep 19, 2023Updated 2 years ago
- Code for recreating the figures in the brainrender paper (Claudi et al. 2020)☆12Dec 11, 2020Updated 5 years ago
- A simple, one-file Python RPC System that is based on Streams allowing for cross-language/SSH usage.☆16Jun 18, 2013Updated 13 years ago
- Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch☆16Dec 11, 2020Updated 5 years ago