gingkg / AlphaZero_Gomoku_PyTorchLinks
基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI
☆28Updated 4 years ago
Alternatives and similar repositories for AlphaZero_Gomoku_PyTorch
Users that are interested in AlphaZero_Gomoku_PyTorch are comparing it to the libraries listed below
Sorting:
- 本项目主要是采用蒙特卡洛搜索树与残差神经网络实现的一个可在小规模硬 件设施上短期训练一个拥有较强棋力的五子棋 AI。参考 AlphaGo Zero 原始论文 《Mastering the game of Go without human knowledge》实现的一个在 五子…☆49Updated 3 years ago
- 机器人走迷宫,Pytorch,强化学习,DQN。☆100Updated 4 years ago
- 腾讯开悟智能体比赛(王者荣耀AI比赛,稳定版)☆71Updated 2 months ago
- A curated list of visual reinforcement learning resources☆464Updated 2 months ago
- ☆421Updated last year
- ☆268Updated last month
- 强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition☆168Updated 4 years ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆144Updated 2 weeks ago
- ☆798Updated 2 years ago
- ☆48Updated 9 months ago
- 《动手学强化学习》练习代码(Pytorch)☆18Updated 3 years ago
- ☆599Updated last year
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆56Updated 5 years ago
- Honor of Kings AI Open Environment of Tencent☆798Updated last year
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆197Updated 3 months ago
- PPO, DDPG, SAC implementation on mujoco environment☆125Updated 3 years ago
- This repository collects some codes that encapsulates commonly used algorithms in the field of machine learning. Most of them are based o…☆617Updated 8 months ago
- The mirror of RL_Coding_Exercise.☆115Updated last year
- 动手学强化学习代码☆66Updated 2 years ago
- basic algorithms of reinforcement learning☆216Updated 2 years ago
- ☆110Updated 2 years ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆148Updated 9 months ago
- rl-papers☆50Updated 2 years ago
- 一个简洁易用3D场景创建和控制工具。基于ThreeJS。纯Python接口。它适用于科研、多智能体强化学习领域的3D演示、娱乐等应用。☆48Updated 2 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆165Updated 6 years ago
- ☆106Updated 6 months ago
- Training a humanoid robot for locomotion using Reinforcement Learning☆1,045Updated 2 weeks ago
- 人工智能大作业项目:五子棋游戏 Artificial intelligence assignment project: Gobang Game☆55Updated 4 years ago
- Reinforcement learning with PyTorch, inspired by MorvanZhou, change the framework from Tensorflow to PyTorch☆315Updated 6 years ago
- 基于stablebaseline3强化学习框架和gym-super-mario-bros马里奥游戏包,训练马里奥通关。☆183Updated last month