bupticybee / gym_chinese_chess
中国象棋gym环境
☆13Updated 4 years ago
Alternatives and similar repositories for gym_chinese_chess:
Users that are interested in gym_chinese_chess are comparing it to the libraries listed below
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆67Updated 7 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆196Updated 5 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆159Updated 3 years ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆163Updated 5 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆88Updated 4 months ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated 2 years ago
- ☆40Updated 2 years ago
- A student implementation of Alpha Go Zero☆279Updated 6 years ago
- ☆3Updated 2 months ago
- ☆9Updated 2 years ago
- A gym game for Contra that for reinforcement learning☆10Updated 3 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- ☆33Updated 7 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago
- A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.☆56Updated 2 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆63Updated 7 years ago
- ☆20Updated 2 years ago
- Implementation of the AlphaZero algorithm for playing the simple board game Gomoku☆15Updated last year
- ☆60Updated 6 years ago
- 以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注意力Transformer结构的斗地主ai☆92Updated 3 years ago
- This project is implementation code of AlphaStar☆195Updated last year
- This repo sets up the environment to play Xiang Qi (chinese chess) following the OpenAI Gym framework.☆34Updated 2 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆17Updated 6 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆43Updated 2 years ago
- SuperMario A3C Trainer for windows☆33Updated 6 years ago
- This is a simple implementation of DeepMind's PySC2 RL agents.☆272Updated 7 years ago
- ☆32Updated 4 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆170Updated 9 months ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆181Updated 6 years ago
- Application of proximal policy optimization algorithm to the card game Big 2 using Tensorflow☆76Updated last year