gingkg / AlphaZero_Gomoku_PyTorchLinks
基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI
☆25Updated 4 years ago
Alternatives and similar repositories for AlphaZero_Gomoku_PyTorch
Users that are interested in AlphaZero_Gomoku_PyTorch are comparing it to the libraries listed below
Sorting:
- 本项目主要是采用蒙特卡洛搜索树与残差神经网络实现的一个可在小规模硬 件设施上短期训练一个拥有较强棋力的五子棋 AI。参考 AlphaGo Zero 原始论文 《Mastering the game of Go without human knowledge》实现的一个在五子…☆45Updated 3 years ago
- AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆105Updated 3 months ago
- ☆387Updated last year
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆52Updated 5 years ago
- ☆215Updated 6 months ago
- 机器人走迷宫,Pytorch,强化学习,DQN。☆95Updated 4 years ago
- A curated list of visual reinforcement learning resources☆378Updated 2 months ago
- This repository collects some codes that encapsulates commonly used algorithms in the field of machine learning. Most of them are based o…☆571Updated 3 months ago
- rl-papers☆48Updated 2 years ago
- Honor of Kings AI Open Environment of Tencent☆770Updated last year
- An easier PyTorch deep reinforcement learning library.☆236Updated 8 months ago
- 腾讯开悟智能体比赛(王者荣耀AI比赛,稳定版)☆53Updated last month
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆175Updated 2 years ago
- NeurIPS 2024 DACER☆138Updated 3 weeks ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆165Updated 6 years ago
- ☆66Updated last year
- basic algorithms of reinforcement learning☆213Updated 2 years ago
- ☆90Updated 3 years ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆125Updated last month
- 强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition☆157Updated 4 years ago
- ☆636Updated 2 years ago
- A minimal codebase for PPO training on MuJoCo environments with some customization supports.☆15Updated 3 years ago
- A collection of notes @SJTU-CSE, written by Yanjie Ze. 上海交通大学计算机系本科生复习笔记。在线浏览网站:https://zeyanjie.gitbook.io/yanjie-zes-note/☆21Updated 3 years ago
- CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making☆635Updated 4 months ago
- PPO, DDPG, SAC implementation on mujoco environment☆117Updated 3 years ago
- 动手学强化学习代码☆60Updated last year
- 基于stablebaseline3强化学习框架和gym-super-mario-bros马里奥游戏包,训练马里奥通关。☆126Updated 2 months ago
- Robot Learning Algorithms☆26Updated last year
- ☆96Updated 2 years ago
- ☆48Updated 4 months ago