zhijs / -Reinforcement-Learning-five-in-a-row-
基于DQN的五子棋人机对弈
☆55Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for -Reinforcement-Learning-five-in-a-row-
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆161Updated 5 years ago
- alphaGo版本的五子棋(gobang, gomoku)☆68Updated 4 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆187Updated 4 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆68Updated 7 years ago
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆479Updated 11 months ago
- ☆22Updated 6 years ago
- 我的强化学习笔记和学习材料 still updating ... ...☆336Updated 5 years ago
- 用深度学习+强化学习编写的一个五子棋人工智障☆36Updated 6 years ago
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆69Updated 6 years ago
- 使用alphazero算法打造属于你自己的象棋AI☆212Updated 2 years ago
- 用强化学习DQN算法,训练AI模型来玩合成大西瓜游戏,提供Keras版本和PARL(paddle)版本☆87Updated 3 years ago
- ☆59Updated 5 years ago
- rl on super-mario-bros☆50Updated 3 years ago
- Learning Resources And Links Of Reinforcement Learning (updating)☆229Updated 3 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆85Updated 3 years ago
- 星际2 AI中文教程 StarCraft2 AI with python-sc2/pysc2 API☆227Updated 3 years ago
- 机巧围棋(CleverGo)基于Python+Pygame+PaddlePaddle打造一款点击按钮就能可视化地训练围棋人工智能的程序。☆65Updated 2 years ago
- Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆76Updated last year
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子。☆47Updated 5 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆253Updated 5 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆157Updated 3 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆440Updated 2 years ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆41Updated 4 years ago
- ☆384Updated 4 years ago
- A Chinese Chess program and a AI based on Monte Carlo Tree Search and Neural Network(like AlphaGo)一个中国象棋程序和一个配套的基于蒙特卡洛算法及神经网络的人工智能(模仿阿尔法…☆105Updated 5 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆85Updated 3 weeks ago
- 强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition☆131Updated 3 years ago
- ☆158Updated last year
- 基于博弈树α-β剪枝搜索的五子棋AI☆697Updated 7 years ago
- A student implementation of Alpha Go Zero☆279Updated 6 years ago