bupticybee / gym_chinese_chess
中国象棋gym环境
☆14Updated 4 years ago
Alternatives and similar repositories for gym_chinese_chess:
Users that are interested in gym_chinese_chess are comparing it to the libraries listed below
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆166Updated 6 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆206Updated 2 months ago
- A student implementation of Alpha Go Zero☆280Updated 6 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆88Updated 6 months ago
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆508Updated last year
- This project is implementation code of AlphaStar☆200Updated last year
- This repo sets up the environment to play Xiang Qi (chinese chess) following the OpenAI Gym framework.☆37Updated 2 years ago
- ☆4Updated 5 months ago
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆75Updated 7 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆67Updated 8 years ago
- (JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play …☆331Updated 2 years ago
- Implementation of the AlphaZero algorithm for playing the simple board game Gomoku☆15Updated last year
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆160Updated 3 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆65Updated 7 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated 2 years ago
- ☆16Updated 3 years ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆57Updated 2 years ago
- An environment of the board game Go using OpenAI's Gym API☆172Updated 3 years ago
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆101Updated 6 years ago
- 中国象棋alpha zero程序☆401Updated 6 years ago
- 使用Muzero算法进行中国象棋对弈☆9Updated 4 years ago
- 以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注意力Transformer结构的斗地主ai☆93Updated 4 years ago
- ☆61Updated 6 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Updated 2 years ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆265Updated 2 weeks ago
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- ☆143Updated 5 months ago
- Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.☆32Updated 5 years ago
- (TG'2021) Code for paper "Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning". TG = Transact…☆10Updated 2 years ago
- ☆165Updated last year