Reinforcement learning of the game of Tic Tac Toe in Python
☆60Sep 28, 2017Updated 8 years ago
Alternatives and similar repositories for Q-learning-Tic-Tac-Toe
Users that are interested in Q-learning-Tic-Tac-Toe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train a tic-tac-toe agent using reinforcement learning.☆73Oct 3, 2025Updated 6 months ago
- OpenAI Gym Style Tic-Tac-Toe Environment☆74Mar 11, 2021Updated 5 years ago
- Reference implementation of the HEAT algorithm described in https://link.springer.com/chapter/10.1007/978-3-030-62362-3_4☆11Mar 24, 2023Updated 3 years ago
- ☆15Apr 10, 2017Updated 9 years ago
- Coursera Deep Learning Specialization: Code Implementation, Lecture Notes and Corresponding Papers☆10Oct 8, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Dynamic Partial Removal: a Neural Network Heuristic for Large Neighborhood Search on Combinatorial Optimization Problems, by applying dee…☆20Jun 17, 2020Updated 5 years ago
- Tutorial files for the JuliaCon 2020 CxxWrap workshop☆14Jul 26, 2020Updated 5 years ago
- a pacman AI with a reinforcement learning agent that utilizes value iteration, policy iteration, policy extraction, Q-learning.☆24Mar 10, 2013Updated 13 years ago
- This is chat bot which is based on term frequency and inverse document frequency and uses cosine similarity to calculate the same.☆14Mar 26, 2018Updated 8 years ago
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆30Mar 1, 2024Updated 2 years ago
- Generative Adversarial Network☆15Oct 12, 2018Updated 7 years ago
- ☆11Sep 21, 2022Updated 3 years ago
- A checkers reinforcement learning AI, and all the tools needed to train it.☆59May 30, 2020Updated 5 years ago
- <앵귤러 마스터북>☆18Jul 15, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Program to simulate admixture between multiple populations☆19Mar 8, 2023Updated 3 years ago
- ☆13Nov 20, 2023Updated 2 years ago
- Resources related to the JuliaGPU GitLab CI.☆25Nov 10, 2020Updated 5 years ago
- Parallel on the ROCks☆18Aug 13, 2020Updated 5 years ago
- Repo for experiments on pyspark and sklearn☆79Feb 19, 2014Updated 12 years ago
- Notebooks on fitting mixed-effects models in Julia☆25Nov 3, 2017Updated 8 years ago
- Efficient Computation and Analysis of Distributional Shapley Values (AISTATS 2021)☆22Oct 19, 2023Updated 2 years ago
- Matches audio to small vocabulary using fast fourier transforms☆15Jan 25, 2015Updated 11 years ago
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Oct 8, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A simple cache that can hold anything, including Swift items☆13Jan 31, 2017Updated 9 years ago
- QWOP AI using Q-learning☆12Jul 13, 2016Updated 9 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆93Jun 21, 2017Updated 8 years ago
- WIP — OpenSwiftUI is an OpenSource implementation of Apple's SwiftUI DSL.☆10Feb 28, 2020Updated 6 years ago
- Code for "Gradient descent GAN optimization is locally stable"☆23Nov 5, 2017Updated 8 years ago
- Interface definitions for the Compute@Edge platform in witx.☆15Feb 11, 2022Updated 4 years ago
- Exploring Automatic Differentiation with Racket☆12Jan 9, 2022Updated 4 years ago
- GPU Acceleration for Apache Spark☆34Aug 24, 2015Updated 10 years ago
- Hierarchical state machine framework in Swift.☆11Nov 2, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A (incomplete) terminal Tetris. Written in Haskell.☆27Jan 18, 2018Updated 8 years ago
- 2048 Reinforcement Learning☆54Jun 17, 2018Updated 7 years ago
- A simple library for SwiftUI to write more structured view module by decouple viewmodifiers☆17Nov 4, 2020Updated 5 years ago
- Julia code in Kenneth Lange's Algorithms from THE BOOK☆31Apr 28, 2025Updated 11 months ago
- Jupyter Notebooks for Learning CS Foundational Concepts using C++☆33Apr 6, 2026Updated last week
- A unix-style utility for responding programmatically to new ethernet devices joining a network☆14Nov 11, 2018Updated 7 years ago
- Port of Scala/Haskell Refined library to Idris☆17Apr 25, 2021Updated 4 years ago