bupticybee / gym_chinese_chess
中国象棋gym环境
☆11Updated 4 years ago
Related projects: ⓘ
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆160Updated 5 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆185Updated 4 years ago
- ☆36Updated last year
- This project is implementation code of AlphaStar☆186Updated 8 months ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆106Updated last year
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆151Updated 4 months ago
- A student implementation of Alpha Go Zero☆276Updated 6 years ago
- ☆21Updated 2 years ago
- ☆24Updated 2 years ago
- 星际2 AI中文教程 StarCraft2 AI with python-sc2/pysc2 API☆225Updated 3 years ago
- An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI make…☆16Updated 6 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 5 years ago
- ☆135Updated 3 years ago
- ☆55Updated this week
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆69Updated 6 years ago
- ☆33Updated 6 years ago
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆472Updated 10 months ago
- This repo sets up the environment to play Xiang Qi (chinese chess) following the OpenAI Gym framework.☆32Updated last year
- ☆31Updated 4 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆68Updated 7 years ago
- This is a simple implementation of DeepMind's PySC2 RL agents.☆271Updated 6 years ago
- ☆10Updated 6 years ago
- (JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play …☆307Updated last year
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆158Updated 3 years ago
- Honor of Kings AI Open Environment of Tencent☆616Updated 2 months ago
- ☆79Updated 2 months ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆36Updated 5 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆85Updated 3 weeks ago
- 中国象棋alpha zero程序☆373Updated 5 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago