yata0 / Mahjong
☆9Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Mahjong
- advantage actor-critic reinforcement learning for openai gym cartpole☆64Updated 7 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆46Updated 2 months ago
- ☆135Updated 3 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 4 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆35Updated 5 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆127Updated last year
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆24Updated 6 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆61Updated 6 years ago
- ☆33Updated 6 years ago
- Learning to Incentivize Other Learning Agents☆31Updated 2 years ago
- ☆97Updated 3 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆110Updated 3 years ago
- ☆80Updated 4 months ago
- PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)☆153Updated 5 years ago
- ☆12Updated 3 years ago
- Efficient Reinforcement Learning with a Thought-Game for StarCraft☆46Updated last year
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆149Updated last year
- This project is implementation code of AlphaStar☆187Updated 9 months ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 2 years ago
- Bomberman deep reinforcement learning challenge in PyTorch☆24Updated 5 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆36Updated 3 years ago
- Keeping track of RL experiments☆159Updated last year
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 5 years ago
- ☆18Updated 5 years ago
- ☆10Updated 6 years ago
- CommNet and BiCnet implementation in tensorflow☆54Updated 6 years ago
- ☆32Updated last year