zhijs / -Reinforcement-Learning-five-in-a-row-Links
基于DQN的五子棋人机对弈
☆59Updated 6 years ago
Alternatives and similar repositories for -Reinforcement-Learning-five-in-a-row-
Users that are interested in -Reinforcement-Learning-five-in-a-row- are comparing it to the libraries listed below
Sorting:
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆165Updated 6 years ago
- alphaGo版本的五子棋(gobang, gomoku)☆68Updated 5 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆210Updated 3 months ago
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆75Updated 7 years ago
- 用深度学习+强化学习编写的一个五子棋人工智障☆41Updated 7 years ago
- 基于强化学习的五子棋☆11Updated 6 years ago
- 基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI☆24Updated 3 years ago
- 应用博弈树搜索,人工神经网络实现五子棋博弈AI。171129:计划更新基于RL训练的新版本,预计18年1月完成☆116Updated 7 years ago
- AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆99Updated last week
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆103Updated 4 years ago
- 用 深度优先搜索 DFS 与 深度强化学习 DRL 分别自动控制 amazing brick 小游戏☆50Updated 10 months ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆69Updated 8 years ago
- ☆62Updated 6 years ago
- 基于博弈树α-β剪枝搜索的五子棋AI☆748Updated 7 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago
- rl on super-mario-bros☆53Updated 4 years ago
- 斯坦福 cs234 强化学习中文讲义☆201Updated 4 years ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆47Updated 5 years ago
- 我的强化学习笔记和学习材料 still updating ... ...☆347Updated 5 years ago
- An Python N-in-Row game based on Monte Carlo Tree Search and UCT RAVE☆51Updated 7 years ago
- 用强化学习DQN算法,训练AI模型来玩合成大西瓜游戏,提供Keras版本和PARL(paddle)版本☆90Updated 4 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆262Updated 6 years ago
- SuperMario A3C Trainer for windows☆35Updated 6 years ago
- Implementation of Machine Learning Algorithms☆407Updated 6 years ago
- 天授中文文档☆58Updated 5 months ago
- 2048 environment for Reinforcement Learning and DQN algorithm☆41Updated 3 years ago
- ☆22Updated 7 years ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆47Updated 6 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五 子棋, 五目並べ, omok, Gobang,...)☆89Updated 7 months ago
- ☆166Updated last year