int8 / regret-matching
Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play
☆25Updated 6 years ago
Alternatives and similar repositories for regret-matching:
Users that are interested in regret-matching are comparing it to the libraries listed below
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆49Updated 6 months ago
- Scalable Implementation of Neural Fictitous Self-Play☆75Updated 6 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆38Updated 3 years ago
- ☆18Updated 3 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆114Updated 8 months ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆70Updated 8 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆170Updated 6 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 2 weeks ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆31Updated 6 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆30Updated 4 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆129Updated 2 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆146Updated last year
- FEN Code☆37Updated 5 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆99Updated 2 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆33Updated last year
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆17Updated 4 years ago
- ☆32Updated 4 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- ☆30Updated 2 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- ☆120Updated 2 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 4 years ago