airaria / AlphaZero_Gomoku_WuZiQiLinks
My implementation of AlphaZero for gomoku (Wu Zi Qi, 五子棋); Poorman's AlphaZero
☆10Updated 7 years ago
Alternatives and similar repositories for AlphaZero_Gomoku_WuZiQi
Users that are interested in AlphaZero_Gomoku_WuZiQi are comparing it to the libraries listed below
Sorting:
- Reinforcement learning algorithms to play Poker☆14Updated 3 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- This is the code for the "How to Beat Pong Using Policy Gradients (LIVE)" by Siraj Raval on Youtube☆70Updated 8 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆43Updated 8 years ago
- ☆15Updated 9 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆58Updated last year
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- ☆19Updated 2 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Updated 6 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆17Updated 6 years ago
- Open AI gym environment for the game 2048☆73Updated 3 years ago
- ☆13Updated 3 years ago
- An Python N-in-Row game based on Monte Carlo Tree Search and UCT RAVE☆51Updated 7 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆32Updated 7 years ago
- A C++ pytorch implementation of MuZero☆38Updated last year
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆210Updated 3 months ago
- ☆69Updated 6 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 7 years ago
- ☆45Updated 2 years ago
- A Policy Network in Tensorflow to classify chess moves☆18Updated 8 years ago
- AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…☆90Updated 7 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆24Updated 4 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆90Updated 7 years ago
- ☆9Updated 6 years ago
- ☆18Updated 6 years ago
- A2C, ACKTR and A2T implementations for ViZDoom☆10Updated 7 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆80Updated 6 years ago