khpeek / Q-learning-HanoiView external linksLinks
Solves the Tower of Hanoi puzzle by Q-learning
☆27Nov 8, 2017Updated 8 years ago
Alternatives and similar repositories for Q-learning-Hanoi
Users that are interested in Q-learning-Hanoi are comparing it to the libraries listed below
Sorting:
- Unlock level without hassle in Candy Crush Saga☆22Sep 5, 2017Updated 8 years ago
- ☆10Feb 13, 2025Updated last year
- ☆11Jun 4, 2023Updated 2 years ago
- Sample notebooks for Juno☆11Mar 1, 2025Updated 11 months ago
- ☆11Sep 8, 2025Updated 5 months ago
- enuSpace plugin for Tensorflow (graphical logic block, flow programming)☆11Feb 6, 2020Updated 6 years ago
- ☆12Jul 11, 2022Updated 3 years ago
- Upload a document image or PDF, or provide a URL, to convert it into a structured format using SmolDocling.☆16Mar 31, 2025Updated 10 months ago
- ☆10Mar 10, 2021Updated 4 years ago
- Convolutional Sparse Coding☆10Jul 18, 2014Updated 11 years ago
- High-quality reference implementations of various algorithms for Inverse Reinforcement Learning☆13Jun 20, 2018Updated 7 years ago
- ☆10Apr 5, 2019Updated 6 years ago
- Documentation on how to use NShiftKey☆12Jun 19, 2023Updated 2 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated 11 months ago
- Randomized Linear Algebra in Python☆13Mar 21, 2017Updated 8 years ago
- Python library implementing recommender systems algorithms with http://tensorflow.org☆12Dec 21, 2018Updated 7 years ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- Sharpened Cosine Distance implementation in PyTorch☆10Feb 1, 2022Updated 4 years ago
- This repo contains active learning query strategies as introduced in our GCPR 2013 paper.☆12Aug 12, 2013Updated 12 years ago
- ☆10Mar 30, 2025Updated 10 months ago
- Deep reinforcement learning + double oracle framework for Robust Restless Bandits☆10Jul 4, 2021Updated 4 years ago
- PyTorch code for DeepTime: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting☆11Jan 9, 2023Updated 3 years ago
- JAX implementation of the Mistral 7b v0.1 model☆13Mar 27, 2024Updated last year
- 유튜브에서 진행한 크롤링&업무자동화 코드 파일 모음집입니다.☆13Aug 15, 2025Updated 6 months ago
- Distributed Deep Learning Benchmark Suite☆11Oct 31, 2022Updated 3 years ago
- ☆11Nov 28, 2022Updated 3 years ago
- 키움증권 자동매매 프로그램입니다.☆22Sep 30, 2023Updated 2 years ago
- A convolutional auto-encoder for compressing time sequence data of stocks.☆12Oct 9, 2017Updated 8 years ago
- A repo for my miscellaneous codes☆11Dec 21, 2016Updated 9 years ago
- Unity ML-Agents Environment for Active Object Tracking with Reinforcement Learning☆12Nov 6, 2020Updated 5 years ago
- The architecture used to train the level generator in the game Relay.☆12Apr 8, 2017Updated 8 years ago
- Deeplearning4j Examples (DL4J, DL4J Spark, DataVec)☆10Aug 16, 2018Updated 7 years ago
- Code and models for the HESS paper on mass conservation and extrapolation with deep learning☆15Nov 11, 2022Updated 3 years ago
- ☆13Jul 3, 2024Updated last year
- Illustration of counterfactual inference following Ferenc Huszar example☆13Aug 15, 2025Updated 6 months ago
- Code for recreating the figures in the brainrender paper (Claudi et al. 2020)☆12Dec 11, 2020Updated 5 years ago
- This repository contains the PLOD Dataset for Abbreviation Detection released with our LREC 2022 publication☆12Sep 25, 2022Updated 3 years ago
- ☆11Jan 28, 2019Updated 7 years ago
- An open source implementation of CLIP.☆11Mar 26, 2023Updated 2 years ago