pandezhao / alpha_sigmaLinks
A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.
☆165Updated 6 years ago
Alternatives and similar repositories for alpha_sigma
Users that are interested in alpha_sigma are comparing it to the libraries listed below
Sorting:
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆211Updated 8 months ago
- ☆62Updated 6 years ago
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆520Updated 2 years ago
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆76Updated 7 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆71Updated 8 years ago
- 基于DQN的五子棋人机对弈☆60Updated 6 years ago
- 星际2 AI中文教程 StarCraft2 AI with python-sc2/pysc2 API☆236Updated 4 years ago
- ☆391Updated 5 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆266Updated 6 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆618Updated 3 years ago
- Implementation of benchmark RL algorithms☆471Updated 3 years ago
- 天授中文文档☆61Updated 11 months ago
- 2048 environment for Reinforcement Learning and DQN algorithm☆40Updated 3 years ago
- 我的强化学习笔记和学习材料 still updating ... ...☆361Updated last month
- Learning Resources And Links Of Reinforcement Learning (updating)☆284Updated 4 years ago
- A translation of Reinforcement Learning: An Introduction☆114Updated 7 years ago
- Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments☆855Updated 5 years ago
- ☆171Updated 2 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆121Updated 2 years ago
- AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆109Updated 5 months ago
- 斯坦福 cs234 强化学习中文讲义☆207Updated 4 years ago
- A student implementation of Alpha Go Zero☆282Updated 7 years ago
- Monte carlo tree search in python☆619Updated 3 years ago
- Collection of Reinforcement Learning / Meta Reinforcement Learning Environments.☆297Updated last year
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- Source codes for the book "Reinforcement Learning: Theory and Python Implementation"☆997Updated 3 weeks ago
- A simple package to allow users to run Monte Carlo Tree Search on any perfect information domain☆235Updated last year
- RL-code for beginners. Enjoying!☆115Updated 5 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆163Updated 4 years ago
- Implementation of Machine Learning Algorithms☆407Updated 6 years ago