airaria / AlphaZero_Gomoku_WuZiQi
My implementation of AlphaZero for gomoku (Wu Zi Qi, 五子棋); Poorman's AlphaZero
☆10Updated 6 years ago
Alternatives and similar repositories for AlphaZero_Gomoku_WuZiQi:
Users that are interested in AlphaZero_Gomoku_WuZiQi are comparing it to the libraries listed below
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Fictitious Self-play & Reinforcement Learning☆19Updated 6 years ago
- A student implementation of Alpha Go Zero☆279Updated 6 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- Series Algorithms of Deep Reinforcement Learning, such as DQN, DDQN, one-step-DQN, DDPG, etc☆41Updated 8 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆191Updated 4 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆56Updated 8 months ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆17Updated 6 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆89Updated 7 years ago
- 9x9 AlphaGo☆13Updated 8 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- Reinforcement Learning in Python☆107Updated 5 years ago
- ☆41Updated 3 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- Policy gradient reinforcement learning algorithm with importance sampling☆31Updated 7 years ago
- ☆47Updated last year
- ☆24Updated 4 years ago
- A2C, ACKTR and A2T implementations for ViZDoom☆10Updated 7 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 2 years ago
- ☆38Updated 2 years ago
- ☆20Updated 2 years ago
- A simplified version of DeepMind's AlphaGo for playing Connect4☆7Updated 7 years ago
- A reproduction of Alphago Zero in "Mastering the game of Go without human knowledge"☆13Updated 6 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆70Updated 8 years ago
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 4 years ago