airaria / AlphaZero_Gomoku_WuZiQi
My implementation of AlphaZero for gomoku (Wu Zi Qi, 五子棋); Poorman's AlphaZero
☆10Updated 7 years ago
Alternatives and similar repositories for AlphaZero_Gomoku_WuZiQi:
Users that are interested in AlphaZero_Gomoku_WuZiQi are comparing it to the libraries listed below
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆58Updated 11 months ago
- A simplified version of DeepMind's AlphaGo for playing Connect4☆7Updated 7 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 4 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- 9x9 AlphaGo☆13Updated 8 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- ☆61Updated 6 years ago
- AlphaGo Zero paper and code for studying purpose☆28Updated 7 years ago
- A student implementation of Alpha Go Zero☆280Updated 6 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Updated 6 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆43Updated 8 years ago
- Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …☆343Updated 2 years ago
- A Policy Network in Tensorflow to classify chess moves☆17Updated 8 years ago
- ☆19Updated 2 years ago
- 游戏AI探索者☆16Updated 6 years ago
- An implementation of the AlphaZero algorithm for chess☆33Updated 2 years ago
- ☆30Updated 6 years ago
- Open AI gym environment for the game 2048☆73Updated 3 years ago
- An Python N-in-Row game based on Monte Carlo Tree Search and UCT RAVE☆51Updated 7 years ago
- A2C, ACKTR and A2T implementations for ViZDoom☆10Updated 7 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆205Updated 2 months ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆53Updated 5 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 6 years ago
- Deep Learning big homework of UCAS☆37Updated 6 years ago
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 7 years ago
- ☆42Updated 3 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- ☆25Updated 4 years ago
- Implementation of the AlphaZero algorithm for playing the simple board game Gomoku☆15Updated last year