yata0 / Mahjong
☆9Updated 3 years ago
Alternatives and similar repositories for Mahjong:
Users that are interested in Mahjong are comparing it to the libraries listed below
- ☆20Updated 2 years ago
- varitional oracle guiding for reinforcement learning☆11Updated 3 years ago
- ☆40Updated 2 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆160Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆75Updated 6 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆65Updated 7 years ago
- Python Fan calculator for Chinese Standard Mahjong☆19Updated 2 months ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆116Updated 3 years ago
- ☆142Updated 3 months ago
- PKU course, Reinforced Learning, final project☆22Updated 4 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆129Updated last year
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆202Updated last month
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆49Updated 6 months ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆17Updated 11 months ago
- Chinese Standard Mahjong Competition hosted by AILab in Peking University.☆99Updated 2 years ago
- Random Network Distillation pytorch☆247Updated 6 years ago
- [NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation☆173Updated 10 months ago
- Keeping track of RL experiments☆162Updated 2 years ago
- ☆3Updated 3 months ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆38Updated 3 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆181Updated 7 years ago
- This project is implementation code of AlphaStar☆198Updated last year
- ☆12Updated 2 years ago
- ☆32Updated 4 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated last year
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Actor-critic with experience replay☆252Updated 2 years ago
- (JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play …☆329Updated 2 years ago
- ☆48Updated last year