gingkg / AlphaZero_Gomoku_PyTorchLinks
基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI
☆24Updated 4 years ago
Alternatives and similar repositories for AlphaZero_Gomoku_PyTorch
Users that are interested in AlphaZero_Gomoku_PyTorch are comparing it to the libraries listed below
Sorting:
- 本项目主要是采用蒙特卡洛搜索树与残差神经网络实现的一个可在小规模硬 件设施上短期训练一个拥有较强棋力的五子棋 AI。参考 AlphaGo Zero 原始论文 《Mastering the game of Go without human knowledge》实现的一个在五子…☆44Updated 2 years ago
- AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆100Updated 3 weeks ago
- ☆361Updated last year
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆165Updated 6 years ago
- An easier PyTorch deep reinforcement learning library.☆226Updated 6 months ago
- ☆66Updated last year
- 机器人走迷宫,Pytorch,强化学习,DQN。☆95Updated 4 years ago
- ☆48Updated last month
- 强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition☆152Updated 4 years ago
- Play atari Tennis game by dqn☆75Updated 3 years ago
- NeurIPS 2024 DACER☆123Updated 3 weeks ago
- 人工智能大作业项目:五子棋游戏 Artificial intelligence assignment project: Gobang Game☆48Updated 4 years ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆130Updated last week
- basic algorithms of reinforcement learning☆212Updated last year
- 基于DQN的五子棋人机对弈☆59Updated 6 years ago
- 基于stablebaseline3强化学习框架和gym-super-mario-bros马里奥游戏包,训练马里奥通关。☆104Updated last week
- 强化学习玩超级马里奥☆75Updated 3 years ago
- ☆165Updated last year
- D3QN 强化学习打只狼☆28Updated 3 years ago
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆47Updated 5 years ago
- Use seaborn to draw RL picture☆26Updated 2 years ago
- CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making☆596Updated 2 months ago
- Python+PyQt5实现五子棋游戏(人机博弈+深搜+α-β剪枝)☆32Updated 3 years ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆110Updated 3 months ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆16Updated 5 years ago
- 动手学强化学习代码☆58Updated last year
- lecture32_AI挑战星际争霸II(强化学习)☆17Updated 2 years ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆155Updated 11 months ago
- A curated list of visual reinforcement learning resources☆298Updated last month
- 南京大学人工智能学院本科生开放日面试经验分享☆28Updated last month