Zeta36 / connect4-alpha-zero
Connect4 reinforcement learning by AlphaGo Zero methods.
☆114Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for connect4-alpha-zero
- Unofficial attempt to rebuild AlphaGo Zero☆56Updated 6 months ago
- Reversi reinforcement learning by AlphaGo Zero methods.☆677Updated last year
- A student implementation of Alpha Go Zero☆279Updated 6 years ago
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…☆88Updated 6 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆267Updated 5 years ago
- An implementation of the AlphaZero algorithm for chess☆34Updated last year
- Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …☆341Updated 2 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated last year
- Chess position evaluation using neural networks☆25Updated 4 years ago
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆82Updated 4 years ago
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 4 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆189Updated 4 years ago
- Demo of UCT (MCTS) in Python / Numpy☆83Updated last year
- A structured implementation of MuZero☆206Updated 2 years ago
- Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow☆74Updated last year
- ☆65Updated 3 years ago
- Minimalistic AlphaGoZero-like Engine☆274Updated 6 years ago
- An implementation of Monte Carlo Tree Search in python☆162Updated 4 years ago
- This is the code for "How Does DeepMind's AlphaGo Zero Work?" Siraj Raval on Youtube☆122Updated 7 years ago
- Chess reinforcement learning by AlphaZero methods.☆38Updated 6 years ago
- MCTS project for Tetris☆342Updated last month
- Implementation of TD-Gammon in TensorFlow.☆110Updated 5 years ago
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆99Updated 6 years ago
- Collection of Deep Reinforcement Learning algorithms☆297Updated 5 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆85Updated last month
- ICML 2018 Self-Imitation Learning☆276Updated 4 years ago
- Pytorch Implementation of MuZero☆343Updated last year
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆360Updated 4 years ago
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆95Updated 5 years ago