Reinforcement learning of the game of Tic Tac Toe in Python
☆60Sep 28, 2017Updated 8 years ago
Alternatives and similar repositories for Q-learning-Tic-Tac-Toe
Users that are interested in Q-learning-Tic-Tac-Toe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train a tic-tac-toe agent using reinforcement learning.☆74Oct 3, 2025Updated 5 months ago
- Reference implementation of the HEAT algorithm described in https://link.springer.com/chapter/10.1007/978-3-030-62362-3_4☆11Mar 24, 2023Updated 3 years ago
- Python implementation of TextRank for text document NLP parsing and summarization☆13Feb 28, 2023Updated 3 years ago
- Dynamic Partial Removal: a Neural Network Heuristic for Large Neighborhood Search on Combinatorial Optimization Problems, by applying dee…☆20Jun 17, 2020Updated 5 years ago
- thundernet object detection☆11Jul 4, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Mar 23, 2016Updated 10 years ago
- A Python Program to implement Machine Learning for the Game Tic Tac Toe (3x3) using Reinforcement Learning (Q learning technique) and ten…☆14Jul 19, 2017Updated 8 years ago
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆29Mar 1, 2024Updated 2 years ago
- R2Plus1D MXNet Implementation☆11Jul 11, 2018Updated 7 years ago
- Programming basics in Python☆10Dec 4, 2016Updated 9 years ago
- Simple Seq2Seq implementation for Keras☆19Mar 19, 2017Updated 9 years ago
- An Android app that uses OpenCL to perform spatial filtering☆20Mar 28, 2013Updated 12 years ago
- A video recommendation system in Python for a cold start, analyzing user behavior and lecture properties of a TunedIt dataset given by Vi…☆13Apr 14, 2019Updated 6 years ago
- ☆16Jul 26, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This is the Sylllabus for Siraj Raval's Reinforcement Learning course "AI for Video Games" on Youtube☆64Jan 5, 2018Updated 8 years ago
- ☆11Jun 6, 2021Updated 4 years ago
- A CUDA-accelerated SIFT implementation.☆13Feb 1, 2015Updated 11 years ago
- GAIL implementation using Tensorflow☆14Sep 17, 2019Updated 6 years ago
- GNU M4 is an implementation of the traditional Unix macro processor.☆13Mar 3, 2017Updated 9 years ago
- A tool to dump OpenCL platform/device information☆10Sep 15, 2020Updated 5 years ago
- training food-101 (achieved SOTA top-1 validation acc ~=90%) using 1-cycle-policy:☆15Aug 24, 2019Updated 6 years ago
- ThunderNet detection framework + DlaNet backbone + ShuffleNetV2 lightweight module☆19Sep 17, 2020Updated 5 years ago
- ☆13Apr 10, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An implementation for CVRP problem with A3C+Attention mechanism and GCN☆18May 17, 2020Updated 5 years ago
- ☆15Sep 17, 2016Updated 9 years ago
- leetcode题解 C++高性能版 (运行时长打败95%+) VSCode+CMake+Catch2☆11Sep 7, 2025Updated 6 months ago
- Some stereo algorithms.☆13May 23, 2016Updated 9 years ago
- A library for lightweight SLAM for 3D scanning of small objects from a webcam/mobile phone☆13Jun 9, 2017Updated 8 years ago
- ☆13Nov 20, 2023Updated 2 years ago
- Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding☆10Jan 5, 2026Updated 2 months ago
- PyTorch implementation of "Asynchronous advantage actor-critic"☆19Oct 30, 2025Updated 4 months ago
- TensorFlow Implementation of Deep3D+, a VGG19 expanded "deconvolution" network for doing depth estimation and in-painting for stereoscopi…☆15Sep 15, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Monocular disparity (inverse depth) estimation network☆12Aug 23, 2017Updated 8 years ago
- RecResNet: A Recurrent Residual CNN Architecture for Disparity Map Enhancement. 3DV 2018☆22Oct 4, 2018Updated 7 years ago
- Deep Reinforcement Learning methods for facilitating Automated Stock Trading☆21May 22, 2021Updated 4 years ago
- Matches audio to small vocabulary using fast fourier transforms☆15Jan 25, 2015Updated 11 years ago
- A program that times various techniques for performing a moving median filter (sometimes called rolling median, or streaming median)☆11Feb 13, 2016Updated 10 years ago
- A Latex template for the NIT Trichy B.Tech (And others) thesis☆11May 11, 2016Updated 9 years ago
- Solutions to all Meta/Facebook puzzles available on Meta's careers website (solutions in multiple languages)☆12May 23, 2025Updated 10 months ago