khpeek/Q-learning-Hanoi

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/khpeek/Q-learning-Hanoi)

khpeek / Q-learning-Hanoi

Solves the Tower of Hanoi puzzle by Q-learning

☆27

Alternatives and similar repositories for Q-learning-Hanoi

Users that are interested in Q-learning-Hanoi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

avelazquez15 / DPM
View on GitHub
Dynamic Power Management using Reinforcement Learning for IoT devices.
☆11Oct 23, 2021Updated 4 years ago
sinairv / Temporal-Difference-Learning
View on GitHub
Temporal Difference Learning and Basic Reinforcement Learning Demos in Matlab
☆16Jul 27, 2016Updated 9 years ago
wangshusen / PyRLA
View on GitHub
Randomized Linear Algebra in Python
☆13Mar 21, 2017Updated 9 years ago
smonsays / metax
View on GitHub
flexible meta-learning in jax
☆16Oct 19, 2023Updated 2 years ago
golems / motion-grammar-kit
View on GitHub
Formal Language Tools for Robots
☆15Jun 29, 2016Updated 10 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
VArdulov / ToMNet
View on GitHub
Reimplementation of ToMNet with some extensions for RL as well
☆14Apr 28, 2018Updated 8 years ago
Jpub / AlphaZero
View on GitHub
<알파제로를 분석하며 배우는 인공지능> 리포지토리
☆14Feb 18, 2020Updated 6 years ago
drasmuss / nhrlmodel
View on GitHub
Neural model of hierarchical reinforcement learning
☆16Sep 14, 2017Updated 8 years ago
brentyi / transformer-exercises-jax
View on GitHub
☆18Apr 17, 2026Updated 2 months ago
FMZennaro / CausalInference
View on GitHub
Illustration of counterfactual inference following Ferenc Huszar example
☆13Aug 15, 2025Updated 10 months ago
cvjena / activeLearning-GP
View on GitHub
This repo contains active learning query strategies as introduced in our GCPR 2013 paper.
☆12Aug 12, 2013Updated 12 years ago
eaertbel / expressiongraph
View on GitHub
This library provides expression trees for representation of geometric expressions and automatic differentiation of these expressions. Th…
☆14Jun 17, 2026Updated 3 weeks ago
mavischer / DRRL
View on GitHub
A2C training of Relational Deep Reinforcement Learning Architecture
☆13Jun 22, 2022Updated 4 years ago
holarissun / PCHID_code
View on GitHub
Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
☆15Jan 7, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
srsohn / shortest-path-rl
View on GitHub
A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"
☆13Jul 19, 2021Updated 4 years ago
tengxiao1 / SimPER
View on GitHub
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)
☆17Aug 22, 2025Updated 10 months ago
ursk / sparco
View on GitHub
Convolutional Sparse Coding
☆10Jul 18, 2014Updated 11 years ago
sygi / vic-tensorflow
View on GitHub
Implementation of Variational Intrinsic Control in tensorflow
☆11Apr 5, 2017Updated 9 years ago
PacktPublishing / Hands-On-Q-Learning-with-Python
View on GitHub
Hands-On Q-Learning with Python, published by Packt
☆29Jan 30, 2023Updated 3 years ago
justinpinkney / data-efficient-gans
View on GitHub
Differentiable Augmentation for Data-Efficient GAN Training
☆11Aug 9, 2020Updated 5 years ago
RobertTLange / gym-hanoi
View on GitHub
A Towers of Hanoi environment in OpenAI Gym Style
☆14Jun 6, 2019Updated 7 years ago
holarissun / embedding-based-llm-alignment
View on GitHub
Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
☆22Apr 24, 2025Updated last year
rr-learning / trifinger_rl_datasets
View on GitHub
A python package for loading robotics datasets which were recorded on the TriFinger platform. Also contains simulated gym environments th…
☆17Jan 17, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jordan-g / PyTorch-Feedback-Alignment-Layers
View on GitHub
PyTorch implementation of linear and convolutional layers with fixed, random feedback weights.
☆15Mar 14, 2021Updated 5 years ago
jonasrothfuss / DeepEpisodicMemory
View on GitHub
Deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and…
☆15Sep 12, 2018Updated 7 years ago
chrysatbr / EEG-Emotion-Recognition
View on GitHub
emotion recognition through eeg by using HOS method
☆10Dec 29, 2021Updated 4 years ago
gkahn13 / gcg-old
View on GitHub
a library for deep reinforcement learning, with applications for navigation
☆16Feb 6, 2018Updated 8 years ago
zswang666 / pointnet.tensorflow
View on GitHub
just a neater version of PointNet and PointNet++ in tensorflow
☆13May 3, 2018Updated 8 years ago
NightmareAI / k-diffusion
View on GitHub
Karras et al. (2022) diffusion models for PyTorch
☆11Aug 23, 2022Updated 3 years ago
numenta / htmresearch-core
View on GitHub
Numenta's experimental C++ research code. Please see htmresearch for more details.
☆27Jul 26, 2019Updated 6 years ago
WenRichard / CNN-in-Answer-selection
View on GitHub
WikiQA，复现论文《APPLYING DEEP LEARNING TO ANSWER SELECTION: A STUDY AND AN OPEN TASK》
☆29Jul 25, 2019Updated 6 years ago
IBM / RuDaS
View on GitHub
RuDaS: Synthetic Datasets for Rule Learning
☆20Jun 21, 2022Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
camroach87 / gefcom2017data
View on GitHub
Data for eight zones in New England as used in the 2017 Global Energy Forecasting Competition (GEFCom2017).
☆13Mar 7, 2020Updated 6 years ago
dnddnjs / mujoco-pg
View on GitHub
PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment
☆15Jul 1, 2018Updated 8 years ago
reinforcement-learning-kr / reinforcement-learning-pytorch
View on GitHub
Minimal and Clean Reinforcement Learning Examples in PyTorch
☆42Dec 25, 2018Updated 7 years ago
team-hdnet / hdnet
View on GitHub
hdnet - Hopfield denoising network
☆14Oct 6, 2022Updated 3 years ago
Tony-sama / pylfit
View on GitHub
Python implementation of the main algorithms of the Learning From Interpretation Transitions (LFIT) framework
☆17Apr 28, 2026Updated 2 months ago
facebookresearch / natural_rl_environment
View on GitHub
Natural Environment Benchmarks for Reinforcement Learning
☆23May 9, 2019Updated 7 years ago
rationalmatter / juno-demo-notebooks
View on GitHub
Sample notebooks for Juno
☆11Mar 1, 2025Updated last year