bupticybee / ChineseChessMuzero
使用Muzero算法进行中国象棋对弈
☆9Updated 4 years ago
Alternatives and similar repositories for ChineseChessMuzero
Users that are interested in ChineseChessMuzero are comparing it to the libraries listed below
Sorting:
- ☆45Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆76Updated 5 months ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- ☆21Updated 2 years ago
- A Doudizhu reinforcement learning AI☆29Updated 4 months ago
- ☆13Updated 2 years ago
- A C++ pytorch implementation of MuZero☆38Updated last year
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆129Updated 6 months ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆206Updated 2 months ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆50Updated 8 months ago
- An environment of the board game Go using OpenAI's Gym API☆172Updated 3 years ago
- This project is implementation code of AlphaStar☆200Updated last year
- MiniZero: An AlphaZero and MuZero Training Framework☆92Updated 2 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- StarCraft 2 Imitation Learning☆29Updated 3 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 6 years ago
- ☆32Updated 4 years ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆47Updated 2 years ago
- Douzero with ResNet and GPU support for Windows☆41Updated 3 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- ☆143Updated 5 months ago
- ☆16Updated 3 years ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆12Updated 4 years ago
- ☆28Updated 2 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Updated last year
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆212Updated 2 years ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆30Updated 3 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆112Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆118Updated 4 years ago