Reinforcement learning on gridworld with Q-learning
☆10Jan 28, 2017Updated 9 years ago
Alternatives and similar repositories for Q-learning-gridworld
Users that are interested in Q-learning-gridworld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scripts and applications for autonomously controlling the Crazyflie using camera/Kinect on a host☆12Feb 16, 2022Updated 4 years ago
- Cuda implementation of semi global block matching for stereo.☆12Aug 12, 2021Updated 4 years ago
- Experimentation with Streamlit for personal LLM tool☆15Jun 19, 2023Updated 2 years ago
- Published by Packt☆11Jan 18, 2021Updated 5 years ago
- Hierarchical Self-Play☆21Dec 5, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simple grid-world environment compatible with OpenAI-gym☆50Mar 19, 2020Updated 6 years ago
- Co-training for Policy Learning☆13Aug 8, 2019Updated 6 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Eticas AI library to help with audits☆11Apr 16, 2025Updated last year
- Robust Optimal Control for Flight Planning☆14May 12, 2025Updated last year
- MetaArcade is a configurable environment suite for meta-learning☆16Oct 19, 2022Updated 3 years ago
- Non stationary bandit for experiments with Reinforcement Learning☆33Mar 24, 2017Updated 9 years ago
- ROBEL: Robotics Benchmarks for Learning with low-cost robots (dev fork)☆13Jul 30, 2020Updated 5 years ago
- ☆12Dec 8, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Cheminformatics tools that work natively with Google tools such as Sheets and BigQuery☆17Jul 12, 2024Updated last year
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- ☆12Jun 24, 2021Updated 4 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- Semi-supervised Latent Dirichlet Allocation (LDA)☆12Dec 21, 2017Updated 8 years ago
- A fast and robust algorithm for temporal difference learning☆23Mar 16, 2026Updated 2 months ago
- Source Code for 'Beginning Game Programming with Pygame Zero: Coding Interactive Games on Raspberry Pi Using Python' by Stewart Watkiss☆15Aug 1, 2020Updated 5 years ago
- Automated Steel Bar Counting and Center Localization with Convolutional Neural Networks☆34May 16, 2019Updated 7 years ago
- Website Templates index page☆12Aug 28, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Temporal Difference Learning based Backgammon game using Neural Network based model☆11Mar 13, 2018Updated 8 years ago
- ☆15Apr 14, 2025Updated last year
- ☆33Nov 21, 2022Updated 3 years ago
- This is an implimentation of Value Iteration Networks (NIPS2016 best paper) in keras☆17Jan 6, 2018Updated 8 years ago
- detecting anomalies in hyper suprime-cam images with generative adversarial networks☆11Aug 3, 2021Updated 4 years ago
- An opensource implementation of kanerva coding for use in reinforcement learning research☆11Mar 28, 2026Updated 2 months ago
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 5 months ago
- Boids are a way of modeling the complex flocking behavior of birds as well as many marine life including schools of fish; the simple rule…☆20Dec 31, 2019Updated 6 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity Tasks".☆11Dec 8, 2022Updated 3 years ago
- ☆17Feb 21, 2020Updated 6 years ago
- A StarCraft 2 agent for harvesting resources☆13Jun 12, 2018Updated 7 years ago
- HTML5 canvas based image editor for the web and ChromeOS☆12Apr 8, 2020Updated 6 years ago
- ☆10Jul 14, 2018Updated 7 years ago
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- ☆10May 11, 2024Updated 2 years ago