int8 / regret-matching
Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play
☆25Updated 6 years ago
Alternatives and similar repositories for regret-matching:
Users that are interested in regret-matching are comparing it to the libraries listed below
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆47Updated 5 months ago
- Scalable Implementation of Neural Fictitous Self-Play☆74Updated 5 years ago
- Fictitious Self-play & Reinforcement Learning☆19Updated 7 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆37Updated 3 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- Reinforcement learning algorithms to play Poker☆15Updated 3 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆16Updated 9 months ago
- ☆18Updated 3 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆17Updated 3 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆113Updated 6 months ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆101Updated 3 weeks ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆44Updated 4 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Updated 2 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆28Updated 6 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆29Updated 4 years ago
- ☆20Updated 2 years ago
- ☆30Updated 2 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆19Updated 2 years ago
- Learning Individual Intrinsic Reward in MARL☆63Updated 2 years ago
- Distributed Deep Reinforcement Learning☆29Updated 4 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆32Updated last year
- ☆32Updated 4 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆29Updated 3 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆19Updated 2 years ago
- ☆97Updated 3 years ago
- This code is based on the implementation of http://www.cs.cmu.edu/afs/cs/Web/People/sandholm/potential-aware_imperfect-recall.aaai14.pdf,…☆34Updated 6 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- ☆71Updated 7 months ago
- CommNet and BiCnet implementation in tensorflow☆54Updated 6 years ago