Train a tic-tac-toe agent using reinforcement learning.
☆76Oct 3, 2025Updated 8 months ago
Alternatives and similar repositories for tictactoe-reinforcement-learning
Users that are interested in tictactoe-reinforcement-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reinforcement learning of the game of Tic Tac Toe in Python☆60Sep 28, 2017Updated 8 years ago
- Implementation of Dueling Network Architectures for Deep Reinforcement Learning paper with Pytorch☆14Sep 26, 2020Updated 5 years ago
- Python port to BlackBerry 10☆14Oct 29, 2012Updated 13 years ago
- a pacman AI with a reinforcement learning agent that utilizes value iteration, policy iteration, policy extraction, Q-learning.☆24Mar 10, 2013Updated 13 years ago
- ☆42Jan 19, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Python 3 Bandit Visualization Package☆11Oct 16, 2017Updated 8 years ago
- Utensil's LLM Playground (2023)☆10May 24, 2026Updated 3 weeks ago
- This is a project based on machine learning and deep learning method for playing Gobang by controlling mechanical arm(利用机械臂下五子棋)☆13Apr 16, 2023Updated 3 years ago
- PyTorch code for TAPAS-GMM.☆15Nov 21, 2024Updated last year
- This repository contains the tutorials and homework assignments for CSCE 790: Neuromorphic computing course at UofSC.☆11Apr 12, 2023Updated 3 years ago
- ☆50Mar 24, 2023Updated 3 years ago
- From Training to Serving: Machine Learning Models with Terraform☆14Jun 7, 2022Updated 4 years ago
- ☆16May 7, 2024Updated 2 years ago
- cadepacote serve para buscar pacote do Correios rápidamente pelo CLI☆11Sep 8, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Mar 7, 2022Updated 4 years ago
- Tutorial files for the JuliaCon 2020 CxxWrap workshop☆14Jul 26, 2020Updated 5 years ago
- Roda & Sequel app for tracking expenses☆16Jun 1, 2025Updated last year
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking☆17Oct 26, 2023Updated 2 years ago
- Reinforcement Learning examples implementation and explanation☆344Jul 9, 2024Updated last year
- Course repo for Advanced Machine Learning Course at Linköping University☆18Oct 28, 2025Updated 7 months ago
- A collection of several Deep Reinforcement Learning techniques (Deep Q Learning, Policy Gradients, ...), gets updated over time.☆38Jan 14, 2020Updated 6 years ago
- ☆13Updated this week
- The official source code and datasets for the paper titled "Evaluating ChatGPT as a Recommender System: A Rigorous Approach"☆13Apr 24, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICLR 2025] "Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning"☆16Nov 30, 2025Updated 6 months ago
- Simple heat and power model of Germany☆12Jun 27, 2022Updated 3 years ago
- The aim of this project, is to know how to improve the quality of MRI brain images by preprocessing them and prepare the dataset for Mach…☆12Jan 1, 2023Updated 3 years ago
- An experiment with video streaming and webcam streaming as texures with A-Frame☆11Nov 19, 2019Updated 6 years ago
- Set an alarm that will call the given function at the specified time.☆10May 11, 2017Updated 9 years ago
- A Gentle Introduction to Transformers Neural Network☆15Mar 3, 2024Updated 2 years ago
- this is a Hugo continuous delivery site☆17Feb 15, 2021Updated 5 years ago
- Beer Game implemented as an OpenAI gym environment.☆17Aug 4, 2019Updated 6 years ago
- ☆15Sep 9, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A reinforcement learning agent for tic-tac-toe. Implements the example from Chapter 1 of Sutton and Barto.☆50Jun 6, 2018Updated 8 years ago
- It simulates Rosbot movement on Gazebo and trains a rainforcement learning model DQN.☆20Apr 5, 2022Updated 4 years ago
- ☆15May 27, 2019Updated 7 years ago
- Combined Learning from Demonstration and Motion Planning☆14Feb 5, 2019Updated 7 years ago
- A new model-based algorithm for offline inverse reinforcement learning☆15Feb 20, 2023Updated 3 years ago
- ☆13Nov 20, 2023Updated 2 years ago
- Serverless Apps with AWS☆11Mar 5, 2023Updated 3 years ago