gingkg / AlphaZero_Gomoku_PyTorch
基于Pytorch, 使用强化学习(自博弈+MCTS)训练一个五子棋AI
☆24Updated 3 years ago
Alternatives and similar repositories for AlphaZero_Gomoku_PyTorch:
Users that are interested in AlphaZero_Gomoku_PyTorch are comparing it to the libraries listed below
- 本项目主要是采用蒙特卡洛搜索树与残差神经网络实现的一个可在小规模硬 件设施上短期训练一个拥有较强棋力的五子棋 AI。参考 AlphaGo Zero 原始论文 《Mastering the game of Go without human knowledge》实现的一个在五子…☆38Updated 2 years ago
- Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程☆92Updated 2 years ago
- ☆62Updated last year
- Mini HoK: a novel MARL benchmark based on the popular mobile game, Honor of Kings, to address limitations in existing environments such a…☆37Updated 3 weeks ago
- 使用alphazero算法打造属于你自己的象棋AI☆250Updated 2 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆164Updated 5 years ago
- An easier PyTorch deep reinforcement learning library.☆192Updated 3 months ago
- 一个简洁易用3D场景创建和控制工具。基于ThreeJS。纯Python接口。它适用于科研、多智能体强化学习领域的3D演示、娱乐等应用。☆40Updated last year
- 强化学习玩超级马里奥☆65Updated 2 years ago
- 人工智能大作业项目:五子棋游戏 Artificial intelligence assignment project: Gobang Game☆46Updated 4 years ago
- ☆163Updated last year
- This is a project based on machine learning and deep learning method for playing Gobang by controlling mechanical arm(利用机械臂下五子棋)☆11Updated last year
- Play atari Tennis game by dqn☆72Updated 2 years ago
- NeurIPS 2024 DACER☆89Updated last month
- A curated list of visual reinforcement learning resources☆220Updated last month
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆201Updated 3 weeks ago
- This repository collects some codes that encapsulates commonly used algorithms in the field of machine learning. Most of them are based o…☆495Updated 3 weeks ago
- 机器人走迷宫,Pytorch,强化学习,DQN。☆89Updated 4 years ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆34Updated 4 months ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆91Updated last week
- ☆322Updated 10 months ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago
- Gobang MCTS :蒙特卡洛搜索树使用C++实现五子棋AI算法 ——同济大学☆10Updated last year
- ♟♟♟♟♟ A Gomoku game AI based on Monte Carlo Tree Search, can be trained on policy-value network now. 一个蒙特卡洛树搜索算法实现的五子棋 AI,现可用神经网络训练模型。☆44Updated 4 years ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆114Updated 2 months ago
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆49Updated 4 years ago
- ☆90Updated 2 years ago
- ☆61Updated 4 months ago
- ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (te…☆166Updated last year
- 2048 environment for Reinforcement Learning and DQN algorithm☆40Updated 2 years ago