Solves the Tower of Hanoi puzzle by Q-learning
☆28Nov 8, 2017Updated 8 years ago
Alternatives and similar repositories for Q-learning-Hanoi
Users that are interested in Q-learning-Hanoi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dynamic Power Management using Reinforcement Learning for IoT devices.☆11Oct 23, 2021Updated 4 years ago
- Temporal Difference Learning and Basic Reinforcement Learning Demos in Matlab☆16Jul 27, 2016Updated 9 years ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- flexible meta-learning in jax☆16Oct 19, 2023Updated 2 years ago
- JAX implementation of the Mistral 7b v0.1 model☆13Mar 27, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Formal Language Tools for Robots☆15Jun 29, 2016Updated 9 years ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 8 years ago
- Neural model of hierarchical reinforcement learning☆16Sep 14, 2017Updated 8 years ago
- ☆18Apr 17, 2026Updated 3 weeks ago
- This repo contains active learning query strategies as introduced in our GCPR 2013 paper.☆12Aug 12, 2013Updated 12 years ago
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Jun 22, 2022Updated 3 years ago
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- A proof-of-concept parser for the Prolog programming language, at the Bern University of Applied Sciences for the course "Automata and fo…☆11Jan 17, 2014Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CACHECA is a cache language model based code suggestion tool.☆12Feb 11, 2016Updated 10 years ago
- PyData San Luis 2017 Tutorial: An Introduction to Gaussian Processes in PyMC3☆15Nov 16, 2017Updated 8 years ago
- Hands-On Q-Learning with Python, published by Packt☆29Jan 30, 2023Updated 3 years ago
- Learn to build neural networks from scratch, simply. No autograd, no deep learning libraries - just numpy.☆10Aug 10, 2022Updated 3 years ago
- ☆10Dec 12, 2017Updated 8 years ago
- My PhD thesis (in progress!)☆15Oct 23, 2016Updated 9 years ago
- Differentiable Augmentation for Data-Efficient GAN Training☆11Aug 9, 2020Updated 5 years ago
- An example repository demonstrating Bazel cc_binary and cc_library build targets.☆11Mar 3, 2016Updated 10 years ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated last year
- A python package for loading robotics datasets which were recorded on the TriFinger platform. Also contains simulated gym environments th…☆17Jan 17, 2024Updated 2 years ago
- PyTorch implementation of linear and convolutional layers with fixed, random feedback weights.☆15Mar 14, 2021Updated 5 years ago
- Deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and…☆15Sep 12, 2018Updated 7 years ago
- a library for deep reinforcement learning, with applications for navigation☆16Feb 6, 2018Updated 8 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆41Dec 25, 2018Updated 7 years ago
- Karras et al. (2022) diffusion models for PyTorch☆11Aug 23, 2022Updated 3 years ago
- Numenta's experimental C++ research code. Please see htmresearch for more details.☆27Jul 26, 2019Updated 6 years ago
- ☆22Sep 19, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- RuDaS: Synthetic Datasets for Rule Learning☆20Jun 21, 2022Updated 3 years ago
- WikiQA,复现论文《APPLYING DEEP LEARNING TO ANSWER SELECTION: A STUDY AND AN OPEN TASK》☆29Jul 25, 2019Updated 6 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆15Jul 1, 2018Updated 7 years ago
- Code for recreating the figures in the brainrender paper (Claudi et al. 2020)☆12Dec 11, 2020Updated 5 years ago
- Python implementation of the main algorithms of the Learning From Interpretation Transitions (LFIT) framework☆17Apr 28, 2026Updated last week
- A simple tutorial of angularjs with flask☆17Feb 1, 2014Updated 12 years ago
- Sample notebooks for Juno☆11Mar 1, 2025Updated last year