buyulian / Five-Chess-DQNLinks
用深度学习+强化学习编写的一个五子棋人工智障
☆43Updated 7 years ago
Alternatives and similar repositories for Five-Chess-DQN
Users that are interested in Five-Chess-DQN are comparing it to the libraries listed below
Sorting:
- 基于DQN的五子棋人机对弈☆60Updated 6 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆109Updated 5 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆165Updated 6 years ago
- reinforcement learning☆51Updated 5 years ago
- 强化学习☆67Updated 6 years ago
- ☆29Updated 7 years ago
- ☆20Updated 7 years ago
- 我的强化学习笔记和学习材料 still updating ... ...☆363Updated 2 months ago
- 基于强化学习的五子棋☆11Updated 6 years ago
- playing Atari game with Deep Q Learning (DQN & DDQN) in tensorflow☆14Updated 7 years ago
- ☆21Updated 7 years ago
- Using deep reinforcement learning to play Snake game(贪吃蛇).☆82Updated 4 years ago
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆150Updated 6 years ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆199Updated 6 years ago
- 应用博弈树搜索,人工神经网络实现五子棋博弈AI。171129:计划更新基于RL训练的新版本,预计18年1月完成☆120Updated 7 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆266Updated 7 years ago
- [动手学强化学习]系列,基于pytorch。☆59Updated 4 years ago
- 用强化学习DQN算法,训练AI模型来玩合成大西瓜游戏,提供Keras版本和PARL(paddle)版本☆95Updated 4 years ago
- 强化学习经典算法(offline\online learning, q-learning, DQN)的实 现在平衡杆游戏和几个Atari 游戏 (CartPole\Pong\Boxing\MsPacman)☆32Updated 7 years ago
- 用 深度优先搜索 DFS 与 深度强化学习 DRL 分别自动控制 amazing brick 小游戏☆52Updated last year
- 强化学习常见算法的实现,Q-Learning/DQN/PG/AC/DDPG/PPO/SAC☆25Updated 3 years ago
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆519Updated 2 years ago
- A Chinese Chess program and a AI based on Monte Carlo Tree Search and Neural Network(like AlphaGo)一个中国象棋程序和一个配套的基于蒙特卡洛算法及神经网络的人工智能(模仿阿尔法…☆121Updated 7 years ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆50Updated 5 years ago
- ☆45Updated 6 years ago
- 这是中国研究生数学建模大赛的C题,用于解决航班恢复的问题。程序首先建立了飞机,航班,客户,机场四个类用于模拟航班调度环境。之后应用遗传算法寻找最优的航班调度方案☆33Updated 7 years ago
- ☆62Updated 6 years ago
- 通过python3.6编程,利用DQN算法实现机器学习避开障碍走到迷宫终点。(Through python3.6 programming, I use DQN algorithm to achieve machine learning and avoid obstacles…☆10Updated 7 years ago
- 人工智能导论课程设计-用强化学习玩FlappyBird☆18Updated 5 years ago
- AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆109Updated 6 months ago