go5paopao / mahjong-selfplay-RL
Deep reinforcement learning of mahjong self-play
☆17Updated 6 years ago
Alternatives and similar repositories for mahjong-selfplay-RL:
Users that are interested in mahjong-selfplay-RL are comparing it to the libraries listed below
- Utility tools for tenhou.net log☆28Updated last year
- varitional oracle guiding for reinforcement learning☆11Updated 2 years ago
- ☆20Updated 2 years ago
- C++版日麻. Japanese Riichi Mahjong written in C++.☆107Updated last month
- ☆40Updated 2 years ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆29Updated 2 years ago
- Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game s…☆80Updated 2 years ago
- Japanese Mahjong AI.☆37Updated 9 years ago
- Mahjong game simulator for RiichiLab https://mjai.app☆68Updated this week
- Scalable Implementation of Neural Fictitous Self-Play☆75Updated 6 years ago
- Scripts for downloading logs from tenhou.net☆56Updated 8 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- Mjx: A framework for Mahjong AI research☆175Updated 10 months ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆169Updated 6 years ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆12Updated 4 years ago
- Game server for Japanese Mahjong AI.☆52Updated 3 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Updated 6 years ago
- ☆13Updated 2 years ago
- ☆9Updated 2 years ago
- C++ implementations of Counterfactual Regret Minimization and Monte Carlo CFR☆72Updated 2 years ago
- Bot for tenhou.net riichi mahjong server written in Python☆200Updated last year
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆136Updated 6 years ago
- Using counter factual regret minimization to computer optimal ranges of hands for each decision☆48Updated 4 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- ☆12Updated 3 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆131Updated 2 years ago
- Open AI gym environment for the game 2048☆71Updated 2 years ago