gingkg / AlphaZero_Gomoku_PyTorch
基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI
☆24Updated 3 years ago
Alternatives and similar repositories for AlphaZero_Gomoku_PyTorch
Users that are interested in AlphaZero_Gomoku_PyTorch are comparing it to the libraries listed below
Sorting:
- 本项目主要是采用蒙特卡洛搜索树与残差神经网络实现的一个可在小规模硬 件设施上短期训练一个拥有较强棋力的五子棋 AI。参考 AlphaGo Zero 原始论文 《Mastering the game of Go without human knowledge》实现的一个在五子…☆42Updated 2 years ago
- Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆97Updated 2 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago
- 使用alphazero算法打造属于你自己的象棋AI☆262Updated 2 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆166Updated 6 years ago
- ☆349Updated last year
- 深度强化学习贪吃蛇游戏。拥有完整游戏环境与AI接口。(项目未完成)☆37Updated 5 years ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆51Updated 4 years ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆45Updated 5 years ago
- ☆65Updated last year
- 强化学习玩超级马里奥☆72Updated 3 years ago
- Python+PyQt5实现五子棋游戏(人机博弈+深搜+α-β剪枝)☆32Updated 3 years ago
- ☆40Updated last week
- OpenAI团队的深度强化学习教程中文版☆29Updated 5 years ago
- A curated list of visual reinforcement learning resources☆265Updated 2 weeks ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆122Updated 3 weeks ago
- ☆90Updated 2 years ago
- 2024年腾讯开悟智能体比赛(王者荣耀AI比赛,稳定版)☆21Updated 8 months ago
- ☆43Updated last year
- PPO, DDPG, SAC implementation on mujoco environment☆109Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆67Updated 11 months ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆105Updated last month
- rl-papers☆47Updated 2 years ago
- 人工智能大作业项目:五子棋游戏 Artificial intelligence assignment project: Gobang Game☆46Updated 4 years ago
- NeurIPS 2024 DACER☆106Updated last week
- Learn to play Sekiro with reinforcement learning.☆16Updated 2 years ago
- ☆45Updated 2 years ago
- ☆165Updated last year
- 动手学强化学习代码☆56Updated last year
- 用强化学习DQN算法,训练AI模型来玩合成大西瓜游戏,提供Keras版本和PARL(paddle)版本☆89Updated 4 years ago