yata0 / Mahjong
☆10Updated 3 years ago
Alternatives and similar repositories for Mahjong:
Users that are interested in Mahjong are comparing it to the libraries listed below
- ☆20Updated 2 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆51Updated 7 months ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆17Updated last year
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆25Updated 6 years ago
- ☆12Updated 2 years ago
- varitional oracle guiding for reinforcement learning☆11Updated 3 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆38Updated 3 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- advantage actor-critic reinforcement learning for openai gym cartpole☆65Updated 7 years ago
- Learning-based agent for Google Research Football (足球游戏智能体)☆111Updated 2 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆77Updated 6 years ago
- Chinese Standard Mahjong Competition hosted by AILab in Peking University.☆104Updated 3 years ago
- ☆44Updated 2 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆20Updated 2 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 3 years ago
- ☆32Updated 4 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆171Updated 6 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆133Updated 7 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆131Updated last year
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆159Updated 2 years ago
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- Keeping track of RL experiments☆162Updated 2 years ago
- ☆22Updated 6 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆118Updated 3 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆74Updated 4 months ago
- ☆120Updated 2 years ago
- Deep reinforcement learning of mahjong self-play☆17Updated 6 years ago
- C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020☆160Updated 3 years ago