pandezhao / alpha_sigma
A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
☆164Updated 5 years ago
Alternatives and similar repositories for alpha_sigma:
Users that are interested in alpha_sigma are comparing it to the libraries listed below
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆200Updated last week
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆74Updated 6 years ago
- ☆60Updated 6 years ago
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆498Updated last year
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆67Updated 7 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆256Updated 6 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated last year
- 星际2 AI中文教程 StarCraft2 AI with python-sc2/pysc2 API☆230Updated 4 years ago
- ☆161Updated last year
- Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆89Updated 2 years ago
- 天授中文文档☆55Updated 3 months ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆44Updated 4 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆17Updated 6 years ago
- Learning Resources And Links Of Reinforcement Learning (updating)☆251Updated 3 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆160Updated 3 years ago
- 用深度学习+强化学习编写的一个五子棋人工智障☆39Updated 7 years ago
- A student implementation of Alpha Go Zero☆280Updated 6 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆88Updated 5 months ago
- Implementation of benchmark RL algorithms☆466Updated 2 years ago
- ☆40Updated 2 years ago
- ☆19Updated 2 years ago
- 基于DQN的五子棋人机对弈☆58Updated 5 years ago
- ☆45Updated 5 years ago
- ☆305Updated 2 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆94Updated 4 years ago
- Implementation of Machine Learning Algorithms☆403Updated 6 years ago
- 使用alphazero算法打造属于你自己的象棋AI☆244Updated 2 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- This project is implementation code of AlphaStar☆197Updated last year