zhijs / -Reinforcement-Learning-five-in-a-row-
基于DQN的五子棋人机对弈
☆58Updated 6 years ago
Alternatives and similar repositories for -Reinforcement-Learning-five-in-a-row-:
Users that are interested in -Reinforcement-Learning-five-in-a-row- are comparing it to the libraries listed below
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆164Updated 5 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆67Updated 7 years ago
- 用深度学习+强化学习编写的一个五子棋人工智障☆39Updated 7 years ago
- alphaGo版本的五子棋(gobang, gomoku)☆67Updated 5 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆201Updated 3 weeks ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆48Updated 6 years ago
- Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程