zhijs / -Reinforcement-Learning-five-in-a-row-Links

基于DQN的五子棋人机对弈

☆59

Alternatives and similar repositories for -Reinforcement-Learning-five-in-a-row-

Users that are interested in -Reinforcement-Learning-five-in-a-row- are comparing it to the libraries listed below

Sorting:

pandezhao / alpha_sigma
A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
☆165Updated 6 years ago
initial-h / AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
☆209Updated 5 months ago
xmfbit / DQN-FlappyBird
Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch
☆69Updated 8 years ago
buyulian / Five-Chess-DQN
用深度学习+强化学习编写的一个五子棋人工智障
☆42Updated 7 years ago
gaoxiaos / Supermariobros-PPO-pytorch
rl on super-mario-bros
☆54Updated 4 years ago
GuoYi0 / alphaFive
alphaGo版本的五子棋(gobang, gomoku)
☆68Updated 5 years ago
chengstone / cchess-zero
AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目，实现中国象棋。
☆517Updated last year
SukerZ / Playing-Flappy-Bird-by-DQN-on-PyTorch
引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台，利用DQN模型玩Flappy Bird游戏，是一个再励学习（强化学习）实验例子。
☆48Updated 6 years ago
zhongjn / gomokuer
A tiny re-implementation of AlphaGo Zero (in Gomoku)
☆76Updated 7 years ago
YoujiaZhang / AlphaGo-Zero-Gobang
AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型，主要用以了解AlphaGo Zero的运行原理的Demo，即神经网络是如何指导MCTS做出决策的，以及如何自我对弈学习。源码+教程
☆104Updated 2 months ago
Sharpiless / play-daxigua-using-Reinforcement-Learning
用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本
☆91Updated 4 years ago
zouyih / AlphaZero_Gomoku-tensorflow
☆62Updated 6 years ago
applenob / rl_learn
我的强化学习笔记和学习材料 still updating ... ...
☆351Updated 6 years ago
YangRui2015 / 2048_env
2048 environment for Reinforcement Learning and DQN algorithm
☆40Updated 3 years ago
zhuliquan / reinforcement_learning_basic_book
这是一个学习强化学习基础原理的仓库，主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码
☆263Updated 6 years ago
apachecn / stanford-cs234-notes-zh
斯坦福 cs234 强化学习中文讲义
☆204Updated 4 years ago
npnpwqf / Renju
基于强化学习的五子棋
☆11Updated 6 years ago
zhangbincheng1997 / expert-system
专家系统作业——井字棋、推理机、决策树
☆47Updated 6 years ago
PiperLiu / Amazing-Brick-DFS-and-DRL
用深度优先搜索 DFS 与深度强化学习 DRL 分别自动控制 amazing brick 小游戏
☆52Updated last year
tinyzqh / awesome-reinforcement-learning
Learning Resources And Links Of Reinforcement Learning （updating）
☆274Updated 4 years ago
PiperLiu / Reinforcement-Learning-practice-zh
强化学习-中文笔记&资源-以python实例为主-由浅入深
☆105Updated 4 years ago
qq456cvb / doudizhu-C
C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020
☆161Updated 4 years ago
thu-ml / tianshou-docs-zh_CN
天授中文文档
☆58Updated 7 months ago
xiaohaomao / Reinforcment-Leanring-algorithm
强化学习经典算法(offline\online learning, q-learning, DQN)的实现在平衡杆游戏和几个Atari 游戏（CartPole\Pong\Boxing\MsPacman）
☆31Updated 7 years ago
tensorfly-gpu / aichess
使用alphazero算法打造属于你自己的象棋AI
☆276Updated 2 years ago
cstrikest / DRL_Snakey
深度强化学习贪吃蛇游戏。拥有完整游戏环境与AI接口。（项目未完成）
☆40Updated 6 years ago
xshura / reinforcement_learning
强化学习
☆66Updated 6 years ago
charleschen003 / doudizhu-rl
强化学习训练斗地主 / doudizhu AI using reinforcement learning.
☆16Updated 5 years ago
cloxnu / Omega_Gomoku_AI
♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI，现可用神经网络训练模型。
☆50Updated 5 years ago
hangsz / reinforcement_learning
[动手学强化学习]系列，基于pytorch。
☆55Updated 4 years ago