pandezhao / alpha_sigmaLinks

A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.

☆165

Alternatives and similar repositories for alpha_sigma

Users that are interested in alpha_sigma are comparing it to the libraries listed below

Sorting:

initial-h / AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
☆209Updated 5 months ago
zhijs / -Reinforcement-Learning-five-in-a-row-
基于DQN的五子棋人机对弈
☆59Updated 6 years ago
ClausewitzCPU0 / SC2AI
星际2 AI中文教程 StarCraft2 AI with python-sc2/pysc2 API
☆233Updated 4 years ago
zhongjn / gomokuer
A tiny re-implementation of AlphaGo Zero (in Gomoku)
☆76Updated 7 years ago
chengstone / cchess-zero
AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目，实现中国象棋。
☆517Updated last year
zouyih / AlphaZero_Gomoku-tensorflow
☆62Updated 6 years ago
zhangchuheng123 / Reinforcement-Implementation
Implementation of benchmark RL algorithms
☆467Updated 3 years ago
zhuliquan / reinforcement_learning_basic_book
这是一个学习强化学习基础原理的仓库，主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码
☆263Updated 6 years ago
tinyzqh / awesome-reinforcement-learning
Learning Resources And Links Of Reinforcement Learning （updating）
☆274Updated 3 years ago
jidiai / ai_lib
☆167Updated last year
xmfbit / DQN-FlappyBird
Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch
☆69Updated 8 years ago
tobegit3hub / ml_implementation
Implementation of Machine Learning Algorithms
☆408Updated 6 years ago
gxnk / reinforcement-learning-code
☆391Updated 4 years ago
qiwihui / reinforcement-learning-an-introduction-chinese
《Reinforcement Learning: An Introduction》（第二版）中文翻译
☆569Updated 3 years ago
YangRui2015 / 2048_env
2048 environment for Reinforcement Learning and DQN algorithm
☆40Updated 3 years ago
qqiang00 / Reinforce
Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments
☆851Updated 5 years ago
rl-cn / rl-cn
A translation of Reinforcement Learning: An Introduction
☆114Updated 6 years ago
dylandjian / SuperGo
A student implementation of Alpha Go Zero
☆280Updated 7 years ago
int8 / monte-carlo-tree-search
Monte carlo tree search in python
☆609Updated 3 years ago
YoujiaZhang / AlphaGo-Zero-Gobang
AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型，主要用以了解AlphaGo Zero的运行原理的Demo，即神经网络是如何指导MCTS做出决策的，以及如何自我对弈学习。源码+教程
☆104Updated 2 months ago
GAOYANGAU / DRLPytorch
Pytorch for Deep Reinforcement Learning
☆250Updated 5 years ago
thu-ml / tianshou-docs-zh_CN
天授中文文档
☆58Updated 7 months ago
johnjim0816 / rl-tutorials
basic algorithms of reinforcement learning
☆211Updated last year
ChengTsang / PPO-clip-and-PPO-penalty-on-Atari-Domain
Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty
☆56Updated 6 years ago
finint / RL-Solutions
强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition
☆157Updated 4 years ago
haroldsultan / MCTS
Python Implementations of Monte Carlo Tree Search
☆310Updated 3 years ago
buyulian / Five-Chess-DQN
用深度学习+强化学习编写的一个五子棋人工智障
☆42Updated 7 years ago
TARTRL / TiKick
Learning-based agent for Google Research Football (足球游戏智能体)
☆121Updated 2 years ago
Teacher-Guo / RL_code
RL-code for beginners. Enjoying!
☆115Updated 5 years ago
2019ChenGong / RL-Paper-notes
☆313Updated 2 years ago