airaria / AlphaZero_Gomoku_WuZiQi
My implementation of AlphaZero for gomoku (Wu Zi Qi, 五子棋); Poorman's AlphaZero
☆10Updated 6 years ago
Alternatives and similar repositories for AlphaZero_Gomoku_WuZiQi:
Users that are interested in AlphaZero_Gomoku_WuZiQi are comparing it to the libraries listed below
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆196Updated 5 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Updated 5 years ago
- ☆20Updated 2 years ago
- An Python N-in-Row game based on Monte Carlo Tree Search and UCT RAVE☆50Updated 7 years ago
- ☆41Updated 3 years ago
- RL library based on algorithms from the book <A-introduction-to-reinforcement-learning>☆90Updated 7 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 2 years ago
- Reinforcement Learning in Python☆107Updated 5 years ago
- A Policy Network in Tensorflow to classify chess moves☆17Updated 8 years ago
- Reinforcement Learning and Transfer Learning based StarCraft Micromanagement☆102Updated 7 years ago
- ☆66Updated 3 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- Unofficial attempt to rebuild AlphaGo Zero☆56Updated 9 months ago
- ☆60Updated 6 years ago
- A simplified version of DeepMind's AlphaGo for playing Connect4☆7Updated 7 years ago
- 游戏AI探索者☆16Updated 6 years ago
- Board game AI implementations using Monte Carlo Tree Search☆183Updated 4 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆69Updated 8 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- a Renju game, replicate paper "Mastering the game of Go with deep neural networks and tree search"☆20Updated 8 years ago
- A2C, ACKTR and A2T implementations for ViZDoom☆10Updated 7 years ago
- RainBow, Tensorflow☆49Updated 6 years ago
- ☆15Updated 8 years ago
- ☆40Updated 2 years ago
- Collection of Deep Reinforcement Learning algorithms☆124Updated 7 years ago
- ☆53Updated 8 years ago