junxiaosong / AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
☆3,449Updated 11 months ago
Alternatives and similar repositories for AlphaZero_Gomoku:
Users that are interested in AlphaZero_Gomoku are comparing it to the libraries listed below
- Implement AlphaZero/AlphaGo Zero methods on Chinese chess.☆1,127Updated 2 years ago
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆502Updated last year
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆201Updated 3 weeks ago
- Reversi reinforcement learning by AlphaGo Zero methods.☆678Updated 2 years ago
- A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more☆4,069Updated 2 months ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆164Updated 5 years ago
- ☆61Updated 6 years ago
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆74Updated 6 years ago
- Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …☆342Updated 2 years ago
- Go AI program which implements the AlphaGo Zero paper☆2,890Updated 6 years ago
- 使用alphazero算法打造属于你自己的象棋AI☆250Updated 2 years ago
- A student implementation of Alpha Go Zero☆280Updated 6 years ago
- 中国象棋alpha zero程序☆395Updated 6 years ago
- ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation☆3,386Updated 5 years ago
- Chess reinforcement learning by AlphaGo Zero methods.☆2,160Updated 2 years ago
- Playing Flappy Bird Using Deep Reinforcement Learning (Based on Deep Q Learning DQN using Tensorflow)☆582Updated 4 years ago
- Monte carlo tree search in python☆598Updated 2 years ago
- A replica of the AlphaZero methodology for deep reinforcement learning in Python☆2,031Updated 2 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,257Updated 5 years ago
- Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆92Updated 2 years ago
- Replicating AlphaGo's architecture in a readable manner☆1,156Updated 4 years ago
- A relatively strong gomoku AI☆87Updated 2 years ago
- 中文整理的强化学习资料(Reinforcement Learning)☆2,001Updated 4 years ago
- An open-source implementation of the AlphaGoZero algorithm☆3,486Updated 4 years ago
- An End-To-End, Lightweight and Flexible Platform for Game Research☆2,089Updated 3 years ago
- 基于DQN的五子棋人机对弈☆58Updated 6 years ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆44Updated 4 years ago
- Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning☆2,522Updated 5 years ago
- 本项目主要是采用蒙特卡洛搜索树与残差神经网络实现的一个可在小规模硬 件设施上短期训练一个拥有较强棋力的五子棋 AI。参考 AlphaGo Zero 原始论文 《Mastering the game of Go without human knowledge》实现的一个在五子…☆38Updated 2 years ago
- ☆390Updated 4 years ago