sjtu-marl / bd_rd_psroLinks
Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
☆23Updated 3 years ago
Alternatives and similar repositories for bd_rd_psro
Users that are interested in bd_rd_psro are comparing it to the libraries listed below
Sorting:
- ☆22Updated 4 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Updated last month
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 4 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 3 years ago
- ☆30Updated 4 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆74Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- curriculum☆27Updated 2 years ago
- ☆12Updated 5 years ago
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Updated last year
- PyTorch Implementation of COPA for coordinating teams that can dynamically change.☆22Updated 3 years ago
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆36Updated 4 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆67Updated 4 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆14Updated last year
- ☆40Updated 3 years ago
- DecentralizedLearning☆25Updated 3 years ago
- ☆42Updated 4 years ago
- Deep Implicit Coordination Graphs☆43Updated last year
- (AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆12Updated 2 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 3 years ago
- ☆25Updated 3 years ago
- Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)☆33Updated 5 years ago
- ☆49Updated 4 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆21Updated last month
- code for ROMANCE☆14Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Updated 11 months ago
- ☆16Updated 3 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Updated 3 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆28Updated 2 years ago
- ☆15Updated 3 years ago