yuxiua / open_chess_zero
Reinforcement Learning alphazero for Chinese chess
☆7Updated 2 years ago
Alternatives and similar repositories for open_chess_zero:
Users that are interested in open_chess_zero are comparing it to the libraries listed below
- 使用Muzero算法进行中国象棋对弈☆9Updated 4 years ago
- ☆21Updated 2 years ago
- ☆12Updated 2 years ago
- Douzero with ResNet and GPU support for Windows☆41Updated 3 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Updated 6 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆206Updated 2 months ago
- A pytorch based Gomoku game model. Alpha Zero algorithm based reinforcement Learning and Monte Carlo Tree Search model.☆166Updated 6 years ago
- 中国象棋alpha zero程序☆401Updated 6 years ago
- ☆16Updated 3 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago
- cchess是一个Python版的中国象棋库☆53Updated 2 months ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 6 years ago
- ☆45Updated 2 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆18Updated last year
- ☆17Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆50Updated 8 months ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆30Updated 3 years ago
- 学习强化学习过程中的笔记和代码☆10Updated 4 years ago
- Deep reinforcement learning of mahjong self-play☆17Updated 6 years ago
- ☆13Updated 3 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆180Updated 11 months ago
- A Deep Reinforcment Learning Aproach to Texas Holdem☆34Updated 3 years ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆12Updated 4 years ago
- ☆18Updated 3 years ago
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆87Updated last year
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆12Updated last year
- A tiny re-implementation of AlphaGo Zero (in Gomoku)☆75Updated 7 years ago
- A gobang AI with Negamax and alpha beta pluning☆13Updated 2 years ago