gingkg / AlphaZero_Gomoku_PyTorchLinks

基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI

☆25

Alternatives and similar repositories for AlphaZero_Gomoku_PyTorch

Users that are interested in AlphaZero_Gomoku_PyTorch are comparing it to the libraries listed below

Sorting:

YoujiaZhang / AlphaGo-Zero-Gobang
AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型，主要用以了解AlphaGo Zero的运行原理的Demo，即神经网络是如何指导MCTS做出决策的，以及如何自我对弈学习。源码+教程
☆103Updated last month
yfismine / AlphaZeroGomoku
本项目主要是采用蒙特卡洛搜索树与残差神经网络实现的一个可在小规模硬件设施上短期训练一个拥有较强棋力的五子棋 AI。参考 AlphaGo Zero 原始论文《Mastering the game of Go without human knowledge》实现的一个在五子…
☆44Updated 3 years ago
TobiasLv / RAD
☆52Updated last month
Starlight0798 / gymRL
基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)
☆113Updated this week
qingshi9974 / PPO-pytorch-Mujoco
Implement PPO algorithm on mujoco environment，such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.
☆53Updated 5 years ago
happy-yan / DACER-Diffusion-with-Online-RL
NeurIPS 2024 DACER
☆124Updated last month
qiwihui / spinningup
OpenAI团队的深度强化学习教程中文版
☆30Updated 5 years ago
leerumor / rl-atari-tennis
Play atari Tennis game by dqn
☆76Updated 3 years ago
johnjim0816 / rl-tutorials
basic algorithms of reinforcement learning
☆212Updated last year
ziwenhahaha / Code-of-RL-Beginning
☆193Updated 5 months ago
datawhalechina / rl-papers
rl-papers
☆47Updated 2 years ago
Zhendong-Wang / Diffusion-Policies-for-Offline-RL
☆372Updated last year
finint / RL-Solutions
强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition
☆156Updated 4 years ago
jidiai / ai_lib
☆166Updated last year
tencent-ailab / marl-mini
☆48Updated 2 months ago
BIT-aerial-robotics / AquaML
☆104Updated 5 months ago
Ericonaldo / ILSwiss
ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…
☆173Updated last year
YangRui2015 / 2048_env
2048 environment for Reinforcement Learning and DQN algorithm
☆40Updated 3 years ago
NoneJou072 / rl-notebook
深度强化学习各算法介绍与Pytorch实现
☆62Updated last year
jidiai / SummerCourse2022
☆90Updated 2 years ago
kaixindelele / RHER
The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…
☆155Updated last year
acezsq / dsx-rl
动手学强化学习代码
☆58Updated last year
johnjim0816 / joyrl-offline
☆66Updated last year
Baichenjia / UTDS
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline RL
☆17Updated last year
Robin-WZQ / Gobang-Chess
This is a project based on machine learning and deep learning method for playing Gobang by controlling mechanical arm（利用机械臂下五子棋）
☆12Updated 2 years ago
datawhalechina / joyrl
An easier PyTorch deep reinforcement learning library.
☆228Updated 6 months ago
MorvanZhou / RLarm
☆16Updated 4 years ago
qiwang067 / awesome-visual-rl
A curated list of visual reinforcement learning resources
☆319Updated 3 weeks ago
YanjieZe / SJTU_Course_Notes
A collection of notes @SJTU-CSE, written by Yanjie Ze. 上海交通大学计算机系本科生复习笔记。在线浏览网站：https://zeyanjie.gitbook.io/yanjie-zes-note/
☆21Updated 3 years ago
zbzhu99 / madiff
Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"
☆79Updated 3 weeks ago