go5paopao / mahjong-selfplay-RLLinks
Deep reinforcement learning of mahjong self-play
☆17Updated 7 years ago
Alternatives and similar repositories for mahjong-selfplay-RL
Users that are interested in mahjong-selfplay-RL are comparing it to the libraries listed below
Sorting:
- ☆24Updated 3 years ago
- C++版日麻. Japanese Riichi Mahjong written in C++.☆120Updated 11 months ago
- varitional oracle guiding for reinforcement learning☆12Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆84Updated 6 years ago
- ☆45Updated 3 years ago
- ☆13Updated 4 years ago
- Implementing reinforcement-learning algorithms for pysc2 -environment☆89Updated 8 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆216Updated 10 months ago
- Utility tools for tenhou.net log☆31Updated last year
- This project is implementation code of AlphaStar☆204Updated last year
- Bot for tenhou.net riichi mahjong server written in Python☆209Updated 2 years ago
- Using counter factual regret minimization to computer optimal ranges of hands for each decision☆51Updated 4 years ago
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆120Updated 2 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆138Updated 7 years ago
- Mahjong4RL is a project that recreates the game of Japanese Mahjong and use deep reinforcement learning to play it.☆12Updated 3 years ago
- ☆147Updated last year
- TensorFlow 2.0 for Deep Reinforcement Learning.☆88Updated 2 years ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆167Updated 2 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆88Updated last year
- StarCraft II Reinforcement Learning with Pytorch - Mini Games☆24Updated 7 years ago
- 星际2 AI中文教程 StarCraft2 AI with python-sc2/pysc2 API☆237Updated 5 years ago
- Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game s…☆99Updated 3 years ago
- (JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play …☆352Updated 3 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆35Updated 7 years ago
- Codification used for the AAMAS-17 paper "Simultaneously Learning and Advising in Multiagent Reinforcement Learning"☆15Updated 8 years ago
- very easy implementation of dueling DQN in pytorch☆74Updated 3 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Updated 7 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆163Updated 4 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆123Updated 2 years ago