go5paopao / mahjong-selfplay-RLLinks
Deep reinforcement learning of mahjong self-play
☆17Updated 6 years ago
Alternatives and similar repositories for mahjong-selfplay-RL
Users that are interested in mahjong-selfplay-RL are comparing it to the libraries listed below
Sorting:
- ☆21Updated 2 years ago
- C++版日麻. Japanese Riichi Mahjong written in C++.☆114Updated 5 months ago
- Reinforcement learning (RL) implementation of imperfect information game Mahjong using markov decision processes to predict future game s…☆90Updated 2 years ago
- varitional oracle guiding for reinforcement learning☆12Updated 3 years ago
- Deep reinforcement learning with tensorflow2☆93Updated last month
- Scripts to download phoenix logs from tenhou.net☆40Updated last year
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆30Updated 3 years ago
- Utility tools for tenhou.net log☆28Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆81Updated 6 years ago
- ☆45Updated 2 years ago
- ☆11Updated 3 years ago
- Mahjong4RL is a project that recreates the game of Japanese Mahjong and use deep reinforcement learning to play it.☆12Updated 3 years ago
- Mjx: A framework for Mahjong AI research☆180Updated last year
- Japanese Mahjong AI.☆38Updated 9 years ago
- ☆13Updated 3 years ago
- Riichi Mahjong Kit: (1) Game log crawler (sqlite3, json, bs4); (2) Game log preprocessor; (3) Deterministic algorithms library☆51Updated 6 years ago
- Mahjong game simulator for RiichiLab https://mjai.app☆85Updated last week
- ☆144Updated 6 months ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆45Updated 2 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆178Updated 11 months ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆137Updated 6 years ago
- OpenAI Gym Env for game Gomoku(Five-In-a-Row, 五子棋, 五目並べ, omok, Gobang,...)☆88Updated 8 months ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 5 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own…☆294Updated 4 months ago
- An AI for 3-player Mahjong (Sanma) using deep reinforcement learning☆37Updated 11 months ago
- Using counter factual regret minimization to computer optimal ranges of hands for each decision☆49Updated 4 years ago
- C++ implementations of Counterfactual Regret Minimization and Monte Carlo CFR☆74Updated 3 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆155Updated 2 years ago