bupticybee / ChineseChessMuzeroLinks
使用Muzero算法进行中国象棋对弈
☆9Updated 5 years ago
Alternatives and similar repositories for ChineseChessMuzero
Users that are interested in ChineseChessMuzero are comparing it to the libraries listed below
Sorting:
- ☆45Updated 2 years ago
- ☆21Updated 2 years ago
- ☆13Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆77Updated 6 months ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆83Updated 2 years ago
- Reinforcement Learning alphazero for Chinese chess☆8Updated 2 years ago
- A Doudizhu reinforcement learning AI☆36Updated last month
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆184Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆51Updated 9 months ago
- Scalable Implementation of Neural Fictitous Self-Play☆81Updated 6 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- ☆16Updated 3 years ago
- ☆12Updated 3 years ago
- StarCraft 2 Imitation Learning☆29Updated 3 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆66Updated last year
- A C++ pytorch implementation of MuZero☆38Updated last year
- This project is implementation code of AlphaStar☆200Updated last year
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- PPO with multi-head/autoregressive action outputs☆39Updated 4 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 6 years ago
- ☆34Updated 4 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆45Updated 2 years ago
- Douzero with ResNet and GPU support for Windows☆43Updated 3 years ago
- Example code for the Gym documentation☆72Updated 2 years ago
- An environment of the board game Go using OpenAI's Gym API☆173Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆50Updated last month
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆105Updated last year
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆16Updated 5 years ago