rpSebastian / AutoCFR
Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)
☆16Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for AutoCFR
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆36Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆46Updated 2 months ago
- ☆18Updated 3 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆27Updated 3 years ago
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆16Updated 3 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆19Updated 2 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆17Updated 2 years ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆17Updated last year
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆45Updated 5 years ago
- Google Research Football MARL Benchmark and Research Toolkit☆34Updated 6 months ago
- ☆9Updated 3 years ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆95Updated 10 months ago
- ☆12Updated 2 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆97Updated 2 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- Implementation of the Off Belief Learning algorithm.☆45Updated 2 years ago
- ☆28Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- Code for magnetic mirror descent.☆15Updated last year
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆24Updated 6 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆52Updated 6 months ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆69Updated last year
- ☆54Updated 8 months ago
- ☆32Updated last year
- ☆14Updated 3 years ago
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆38Updated 3 weeks ago
- ☆12Updated 4 years ago
- ☆11Updated 2 years ago
- Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆10Updated last year
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆25Updated last year