int8 / regret-matchingLinks
Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play
☆25Updated 6 years ago
Alternatives and similar repositories for regret-matching
Users that are interested in regret-matching are comparing it to the libraries listed below
Sorting:
- Fictitious Self-play & Reinforcement Learning☆18Updated 7 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆80Updated 6 years ago
- Reinforcement learning algorithms to play Poker☆14Updated 3 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- Counterfactual regret minimization algorithm for Kuhn poker☆172Updated 6 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 6 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆117Updated 10 months ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆51Updated 9 months ago
- Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"☆71Updated 8 years ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆19Updated last year
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- ☆30Updated 2 years ago
- ☆18Updated 4 years ago
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- Distributed Deep Reinforcement Learning☆29Updated 4 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆101Updated 2 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- ☆33Updated 7 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Updated 6 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆148Updated 2 years ago
- ☆52Updated 6 years ago
- ☆76Updated last year
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Updated last year
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago
- ☆21Updated 2 years ago
- ☆92Updated 4 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated 2 months ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆21Updated 2 years ago