Solves the Tower of Hanoi puzzle by Q-learning
☆28Nov 8, 2017Updated 8 years ago
Alternatives and similar repositories for Q-learning-Hanoi
Users that are interested in Q-learning-Hanoi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides code for "Polyp Segmentation via Semantic Enhanced Perceptual Network" IEEE TCSVT-2024.☆15Mar 31, 2026Updated 2 weeks ago
- 用于水下图像视屏恢复和增强,复现CVPR-2012的一篇论文: 一种用于增强水下图像和视频的融合策略。代码使用python、☆15Oct 13, 2024Updated last year
- Temporal Difference Learning and Basic Reinforcement Learning Demos in Matlab☆16Jul 27, 2016Updated 9 years ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- flexible meta-learning in jax☆16Oct 19, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A deep convolutional neural network for Semantic Segmentation of Forest Fire Images☆14Jun 2, 2023Updated 2 years ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 7 years ago
- Repository for UMD CS Course: Introduction to Data Science I: Preparing, Storing, and Manipulating Data☆17Dec 13, 2014Updated 11 years ago
- [ICASSP 2025] Underwater Image Restoration via Polymorphic Large Kernel CNNs☆24Apr 22, 2025Updated 11 months ago
- Neural model of hierarchical reinforcement learning☆16Sep 14, 2017Updated 8 years ago
- Q-Learning applied to the classic Travelling Salesman Problem☆19Apr 6, 2017Updated 9 years ago
- ☆18Mar 18, 2026Updated last month
- Illustration of counterfactual inference following Ferenc Huszar example☆13Aug 15, 2025Updated 8 months ago
- This repo contains active learning query strategies as introduced in our GCPR 2013 paper.☆12Aug 12, 2013Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This library provides expression trees for representation of geometric expressions and automatic differentiation of these expressions. Th…☆14Aug 24, 2023Updated 2 years ago
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Jun 22, 2022Updated 3 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- PyData San Luis 2017 Tutorial: An Introduction to Gaussian Processes in PyMC3☆15Nov 16, 2017Updated 8 years ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Apr 5, 2017Updated 9 years ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Aug 3, 2023Updated 2 years ago
- Web前端期末项目,以中国航天为主题的前端网页,内容包括我国航天的发展、航天成就、航天知识、航天新闻、宇航员介绍以及航天产品。运用html、css以及javascript技术。网页包含主页面和子页面以及表单页面。☆25Feb 3, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Nov 28, 2022Updated 3 years ago
- ☆10Dec 12, 2017Updated 8 years ago
- My PhD thesis (in progress!)☆14Oct 23, 2016Updated 9 years ago
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆21Oct 19, 2025Updated 6 months ago
- A Towers of Hanoi environment in OpenAI Gym Style☆14Jun 6, 2019Updated 6 years ago
- An example repository demonstrating Bazel cc_binary and cc_library build targets.☆11Mar 3, 2016Updated 10 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated last year
- An AI-powered tool for transcribing, summarizing, and creating smart clips from video and audio content.☆57Sep 21, 2024Updated last year
- PyTorch implementation of linear and convolutional layers with fixed, random feedback weights.☆15Mar 14, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A convolutional auto-encoder for compressing time sequence data of stocks.☆12Oct 9, 2017Updated 8 years ago
- emotion recognition through eeg by using HOS method☆10Dec 29, 2021Updated 4 years ago
- CUDA extension for the SPORCO project☆18Jul 5, 2021Updated 4 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆41Dec 25, 2018Updated 7 years ago
- just a neater version of PointNet and PointNet++ in tensorflow☆13May 3, 2018Updated 7 years ago
- [IEEE TMI 2024] MultiEYE: Dataset and Benchmark for OCT-Enhanced Retinal Disease Recognition from Fundus Images☆47Dec 8, 2025Updated 4 months ago
- Karras et al. (2022) diffusion models for PyTorch☆11Aug 23, 2022Updated 3 years ago