go5paopao / mahjong-selfplay-RLLinks
Deep reinforcement learning of mahjong self-play
☆17Updated 7 years ago
Alternatives and similar repositories for mahjong-selfplay-RL
Users that are interested in mahjong-selfplay-RL are comparing it to the libraries listed below
Sorting:
- ☆22Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆83Updated 6 years ago
- varitional oracle guiding for reinforcement learning☆12Updated 3 years ago
- ☆59Updated 2 months ago
- 基于RLCard平台的麻将mahjong博弈游戏代码,包括基于规则和基于Dueling DQN的Agent模型。☆30Updated 3 years ago
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆209Updated 5 months ago
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Using counter factual regret minimization to computer optimal ranges of hands for each decision☆49Updated 4 years ago
- ☆45Updated 2 years ago
- ☆13Updated 3 years ago
- Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-…☆15Updated 7 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆26Updated 6 years ago
- C++ implementations of Counterfactual Regret Minimization and Monte Carlo CFR☆75Updated 3 years ago
- StarCraft II Reinforcement Learning with Pytorch - Mini Games☆24Updated 7 years ago
- A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow☆15Updated 7 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- C++版日麻. Japanese Riichi Mahjong written in C++.☆116Updated 7 months ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Scalable Implementation of Deep CFR and Single Deep CFR☆303Updated 5 years ago
- Mahjong4RL is a project that recreates the game of Japanese Mahjong and use deep reinforcement learning to play it.☆12Updated 3 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆175Updated 6 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆127Updated 2 years ago
- Modular Multi-Objective Reinforcement Learning with Decision Values☆24Updated 2 years ago
- Codification used for the AAMAS-17 paper "Simultaneously Learning and Advising in Multiagent Reinforcement Learning"☆15Updated 7 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆32Updated 8 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 7 years ago
- PPO with multi-head/autoregressive action outputs☆42Updated 4 years ago
- RainBow, Tensorflow☆49Updated 7 years ago
- StarCraft II / PySC2 Deep Reinforcement Learning Agents (A2C)☆137Updated 6 years ago
- TensorFlow 2.0 for Deep Reinforcement Learning.☆87Updated last year