bupticybee / ChineseChessMuzero
使用Muzero算法进行中国象棋对弈
☆9Updated 4 years ago
Alternatives and similar repositories for ChineseChessMuzero:
Users that are interested in ChineseChessMuzero are comparing it to the libraries listed below
- ☆44Updated 2 years ago
- ☆20Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆73Updated 4 months ago
- ☆67Updated 3 years ago
- This project is implementation code of AlphaStar☆199Updated last year
- A Doudizhu reinforcement learning AI☆26Updated 3 months ago
- Douzero with ResNet and GPU support for Windows☆41Updated 3 years ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆31Updated 3 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- A C++ pytorch implementation of MuZero☆37Updated 11 months ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- ☆13Updated 2 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆51Updated 7 months ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- Example code for the Gym documentation☆71Updated last year
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆31Updated 2 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 2 years ago
- Deep Reinforcement Learning for Multiplayer Online Battle Arena☆80Updated last year
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆125Updated 6 months ago
- An environment of the board game Go using OpenAI's Gym API☆173Updated 2 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆77Updated 6 years ago
- StarCraft 2 Imitation Learning☆29Updated 3 years ago
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆87Updated last year
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆28Updated 2 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆177Updated 11 months ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆44Updated 2 years ago
- ☆143Updated 4 months ago
- ☆32Updated 4 years ago