zhijs / -Reinforcement-Learning-five-in-a-row-
基于DQN的五子棋人机对弈
☆58Updated 6 years ago
Alternatives and similar repositories for -Reinforcement-Learning-five-in-a-row-:
Users that are interested in -Reinforcement-Learning-five-in-a-row- are comparing it to the libraries listed below
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆166Updated 6 years ago
- 用深度学习+强化学习编写的一个五子棋人工智障☆41Updated 7 years ago
- Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆96Updated 2 years ago
- alphaGo版本的五子棋(gobang, gomoku)☆68Updated 5 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆204Updated last month
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆75Updated 7 years ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆45Updated 5 years ago
- ☆22Updated 7 years ago
- 应用博弈树搜索,人工神经网络实现五子棋博弈AI。171129:计划更新基于RL训练的新版本,预计18年1月完成☆116Updated 6 years ago
- Deep Learning big homework of UCAS☆37Updated 6 years ago
- 使用alphazero算法打造属于你自己的象棋AI☆258Updated 2 years ago
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆507Updated last year
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆67Updated 7 years ago
- 我的强化学习笔记和学习材料 still updating ... ...☆345Updated 5 years ago
- 基于强化学习的五子棋☆11Updated 6 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆47Updated 6 years ago
- 基于博弈树α-β剪枝搜索的五子棋AI☆735Updated 7 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆88Updated 6 months ago
- An Python N-in-Row game based on Monte Carlo Tree Search and UCT RAVE☆51Updated 7 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆160Updated 3 years ago
- ☆224Updated 5 years ago
- ☆61Updated 6 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆258Updated 6 years ago
- A Chinese Chess program and a AI based on Monte Carlo Tree Search and Neural Network(like AlphaGo)一个中国象棋程序和一个配套的基于蒙特卡洛算法及神经网络的人工智能(模仿阿尔法…☆112Updated 6 years ago
- A gobang AI with Negamax and alpha beta pluning☆13Updated 2 years ago
- SuperMario A3C Trainer for windows☆33Updated 6 years ago
- 强化学习玩超级马里奥☆72Updated 2 years ago
- 使用Python制作的五子棋游戏☆140Updated 9 months ago
- 强化学习经典算法(offline\online learning, q-learning, DQN)的实现在平衡杆游戏和几个Atari 游戏 (CartPole\Pong\Boxing\MsPacman)☆29Updated 6 years ago