Solves the Tower of Hanoi puzzle by Q-learning
☆28Nov 8, 2017Updated 8 years ago
Alternatives and similar repositories for Q-learning-Hanoi
Users that are interested in Q-learning-Hanoi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dynamic Power Management using Reinforcement Learning for IoT devices.☆11Oct 23, 2021Updated 4 years ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- flexible meta-learning in jax☆16Oct 19, 2023Updated 2 years ago
- JAX implementation of the Mistral 7b v0.1 model☆13Mar 27, 2024Updated 2 years ago
- <알파제로를 분석하며 배우는 인공지능> 리포지토리☆14Feb 18, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Q-Learning applied to the classic Travelling Salesman Problem☆19Apr 6, 2017Updated 8 years ago
- ☆18Mar 18, 2026Updated last week
- Illustration of counterfactual inference following Ferenc Huszar example☆13Aug 15, 2025Updated 7 months ago
- This repo contains active learning query strategies as introduced in our GCPR 2013 paper.☆12Aug 12, 2013Updated 12 years ago
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Jun 22, 2022Updated 3 years ago
- This library provides expression trees for representation of geometric expressions and automatic differentiation of these expressions. Th…☆14Aug 24, 2023Updated 2 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyData San Luis 2017 Tutorial: An Introduction to Gaussian Processes in PyMC3☆15Nov 16, 2017Updated 8 years ago
- Convolutional Sparse Coding☆10Jul 18, 2014Updated 11 years ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Apr 5, 2017Updated 8 years ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Aug 3, 2023Updated 2 years ago
- Hands-On Q-Learning with Python, published by Packt☆29Jan 30, 2023Updated 3 years ago
- PyMC3 implementation of Drew Linzer’s dynamic Bayesian election forecasting model☆12Nov 4, 2016Updated 9 years ago
- ☆11Nov 28, 2022Updated 3 years ago
- ☆10Dec 12, 2017Updated 8 years ago
- many powerful tools for studying irreducible representations of SU(n), including making animations of hadron flavor-state multiplets☆13Jul 25, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- My PhD thesis (in progress!)☆14Oct 23, 2016Updated 9 years ago
- Differentiable Augmentation for Data-Efficient GAN Training☆11Aug 9, 2020Updated 5 years ago
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- An example repository demonstrating Bazel cc_binary and cc_library build targets.☆12Mar 3, 2016Updated 10 years ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated 11 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year
- A python package for loading robotics datasets which were recorded on the TriFinger platform. Also contains simulated gym environments th…☆17Jan 17, 2024Updated 2 years ago
- PyTorch implementation of linear and convolutional layers with fixed, random feedback weights.☆15Mar 14, 2021Updated 5 years ago
- Deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and…☆15Sep 12, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A convolutional auto-encoder for compressing time sequence data of stocks.☆12Oct 9, 2017Updated 8 years ago
- emotion recognition through eeg by using HOS method☆10Dec 29, 2021Updated 4 years ago
- CUDA extension for the SPORCO project☆18Jul 5, 2021Updated 4 years ago
- Karras et al. (2022) diffusion models for PyTorch☆12Aug 23, 2022Updated 3 years ago
- Numenta's experimental C++ research code. Please see htmresearch for more details.☆27Jul 26, 2019Updated 6 years ago
- ☆22Sep 19, 2023Updated 2 years ago
- RuDaS: Synthetic Datasets for Rule Learning☆19Jun 21, 2022Updated 3 years ago