indylab / tabular_xdo
☆9Updated 3 years ago
Alternatives and similar repositories for tabular_xdo:
Users that are interested in tabular_xdo are comparing it to the libraries listed below
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆38Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆49Updated 6 months ago
- ☆18Updated 3 years ago
- ☆11Updated 2 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆17Updated 4 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆46Updated 6 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆29Updated 3 years ago
- ☆13Updated 2 years ago
- ☆53Updated last year
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated 2 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Code for magnetic mirror descent.☆16Updated last year
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆21Updated 8 months ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆45Updated 3 years ago
- ☆17Updated last year
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- ☆18Updated 2 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 4 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆20Updated 2 years ago
- ☆12Updated 4 years ago
- A set of competitive environments for Reinforcement Learning research.☆29Updated 2 years ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆31Updated 2 years ago
- ☆12Updated 3 years ago
- ☆14Updated 3 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 2 years ago
- Pytorch implementation of Stable Opponent Shaping (https://openreview.net/pdf?id=SyGjjsC5tQ).☆21Updated 5 years ago
- The Starcraft Multi-Agent challenge lite☆42Updated 6 months ago
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆16Updated 11 months ago
- ☆74Updated 9 months ago
- PyTorch IMPALA implementation☆26Updated 5 years ago