yuxiua / open_chess_zero
Reinforcement Learning alphazero for Chinese chess
☆7Updated last year
Alternatives and similar repositories for open_chess_zero:
Users that are interested in open_chess_zero are comparing it to the libraries listed below
- Douzero with ResNet and GPU support for Windows☆39Updated 3 years ago
- ☆40Updated 2 years ago
- ☆20Updated 2 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆15Updated 5 years ago
- A gobang AI with Negamax and alpha beta pluning☆13Updated 2 years ago
- 使用Muzero算法进行中国象棋对弈☆9Updated 4 years ago
- ☆12Updated 2 years ago
- A Doudizhu reinforcement learning AI☆23Updated 3 months ago
- An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.☆84Updated last year
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆502Updated last year
- ☆17Updated last year
- 中国象棋alpha zero程序☆397Updated 6 years ago
- 中国象棋gym环境☆14Updated 4 years ago
- Self-Labeling the Job Shop Scheduling Problem☆12Updated 9 months ago
- RLA is a tool for managing your RL experiments automatically☆71Updated 2 years ago
- ☆16Updated 3 years ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆30Updated 2 years ago
- Deep reinforcement learning of mahjong self-play☆17Updated 6 years ago
- A Chinese Chess program and a AI based on Monte Carlo Tree Search and Neural Network(like AlphaGo)一个中国象棋程序和一个配套的基于蒙特卡洛算法及神经网络的人工智能(模仿阿尔法…☆110Updated 6 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆174Updated 10 months ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆202Updated last month
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆38Updated 3 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- Bomberman deep reinforcement learning challenge in PyTorch☆25Updated 6 years ago
- PPO with multi-head/autoregressive action outputs☆39Updated 4 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆76Updated 6 years ago
- ☆19Updated 9 months ago
- ☆17Updated 2 years ago
- Official repository for the TMLR paper "Self-Improvement for Neural Combinatorial Optimization: Sample Without Replacement, but Improveme…☆25Updated 2 weeks ago