initial-h / AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
☆189Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for AlphaZero_Gomoku_MPI
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆161Updated 5 years ago
- ☆59Updated 5 years ago
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆483Updated last year
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆69Updated 6 years ago
- A student implementation of Alpha Go Zero☆279Updated 6 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆16Updated 6 years ago
- Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …☆341Updated 2 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆85Updated last month
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- Implement AlphaZero/AlphaGo Zero methods on Chinese chess.☆1,089Updated last year
- Reversi reinforcement learning by AlphaGo Zero methods.☆677Updated last year
- Deep Learning big homework of UCAS☆37Updated 5 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated last year
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆68Updated 7 years ago
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- A simple package to allow users to run Monte Carlo Tree Search on any perfect information domain☆208Updated 5 months ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆157Updated 3 years ago
- 中国象棋alpha zero程序☆378Updated 5 years ago
- ☆158Updated last year
- An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)☆3,330Updated 6 months ago
- Pytorch Implementation of MuZero☆343Updated last year
- Implementation of benchmark RL algorithms☆459Updated 2 years ago
- Simple A3C implementation with pytorch + multiprocessing☆622Updated last year
- A parallel framework for population-based multi-agent reinforcement learning.☆497Updated 11 months ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆81Updated 3 weeks ago
- This project is implementation code of AlphaStar☆187Updated 10 months ago
- ☆9Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆112Updated 3 years ago
- ☆384Updated 4 years ago
- AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…☆88Updated 6 years ago