Implementation of the AlphaZero algorithm for playing the simple board game Gomoku
☆14May 22, 2023Updated 2 years ago
Alternatives and similar repositories for AlphaPig
Users that are interested in AlphaPig are comparing it to the libraries listed below
Sorting:
- Here are some Python implementations of Gomoku AIs, including MCTS, Minimax and Genetic Alg.☆33Dec 14, 2018Updated 7 years ago
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆78Apr 16, 2018Updated 7 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Jun 29, 2016Updated 9 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Run perfetto with Docker and docker-compose (self signed certificates)☆11Feb 1, 2023Updated 3 years ago
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- [AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".☆21Jul 26, 2025Updated 7 months ago
- Hands-On TensorBoard for PyTorch Developers, Published by Packt☆11Dec 15, 2025Updated 2 months ago
- Recommendation engine and it's algorithms in python , R .☆12Oct 26, 2018Updated 7 years ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 3 weeks ago
- Fast interpolative decompositions in Python☆10Jan 4, 2021Updated 5 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- DQN with freezing target network in tensorflow on pygame FlappyBird☆11Dec 19, 2018Updated 7 years ago
- The repo for Shen Group's FMAB repo☆11Jan 21, 2021Updated 5 years ago
- An attempt to apply reinforcement learning to graph signal recovery problem☆11Aug 25, 2021Updated 4 years ago
- C#的GUI五子棋大作业 包括禁手 AI 简单直播功能☆10Dec 14, 2018Updated 7 years ago
- ☆11Dec 26, 2017Updated 8 years ago
- Connect6 (Korean: 육목) for Python.☆11May 15, 2017Updated 8 years ago
- Deep Learning big homework of UCAS☆39Jan 8, 2019Updated 7 years ago
- Renju mate solver compilable to WebAssembly☆11Mar 10, 2024Updated last year
- TensorFlow implementation of CapsNet☆10Apr 3, 2020Updated 5 years ago
- An implementation of the Jenkins Traub polynomial root finding algorithm☆14Aug 23, 2015Updated 10 years ago
- Simulated Annealing for MAX-CUT problems on {+1,-1}-weighted complete graphs☆12Feb 2, 2019Updated 7 years ago
- Neural network training code for Gomoku/Renju AI☆13Feb 10, 2026Updated 3 weeks ago
- Learning pinyin (拼音) alphabet wechat-miniprogram, use Taro.js to build.☆12Jul 7, 2023Updated 2 years ago
- Ultralightweight JSON parser in ANSI C☆10Mar 8, 2017Updated 8 years ago
- A gomoku AI based on Alpha Zero paper.☆12May 1, 2023Updated 2 years ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated 3 weeks ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- small language models training made easy☆13Dec 15, 2024Updated last year
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- Reinforcement learning on gridworld with Q-learning☆10Jan 28, 2017Updated 9 years ago
- OpenCLDemo for Redmi Note 4X (nikel, MTK), Nexus 5, Nexus 6p and Pixel 2☆13Apr 14, 2018Updated 7 years ago
- Short text similarity matching model based on deep learning and machine learning☆15Jan 9, 2019Updated 7 years ago
- Implementation of SENets by chainer (Squeeze-and-Excitation Networks: https://arxiv.org/abs/1709.01507)☆15Sep 15, 2017Updated 8 years ago
- ☆13Sep 14, 2021Updated 4 years ago
- ☆13May 26, 2016Updated 9 years ago
- Deep Reinforcement Learning that makes you smile☆16Jul 6, 2017Updated 8 years ago