bupticybee / ChineseChessMuzero
使用Muzero算法进行中国象棋对弈
☆9Updated 4 years ago
Alternatives and similar repositories for ChineseChessMuzero:
Users that are interested in ChineseChessMuzero are comparing it to the libraries listed below
- ☆40Updated 2 years ago
- ☆20Updated 2 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆75Updated 6 years ago
- Example code for the Gym documentation☆71Updated last year
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- ☆12Updated 2 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆49Updated 7 months ago
- A Doudizhu reinforcement learning AI☆22Updated 3 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- This project is implementation code of AlphaStar☆198Updated last year
- A C++ pytorch implementation of MuZero☆36Updated 11 months ago
- An environment of the board game Go using OpenAI's Gym API☆174Updated 2 years ago
- Reinforcement Learning alphazero for Chinese chess☆7Updated last year
- 2D Overlooking Shooting Game☆9Updated 4 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆30Updated 2 years ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆30Updated 2 years ago
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆83Updated last year
- StarCraft 2 Imitation Learning☆29Updated 3 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆173Updated 10 months ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆73Updated 3 months ago
- ☆16Updated 3 years ago
- ☆13Updated 2 years ago
- Douzero with ResNet and GPU support for Windows☆39Updated 3 years ago
- ☆18Updated 3 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- The Arcade Learning Environment (ALE) -- a platform for AI research.☆22Updated 6 months ago
- A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.☆21Updated 5 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆38Updated 3 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆51Updated 4 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago