machine-teaching-group / neurips2022_exploration-guided-reward-shaping
☆14Updated 2 years ago
Alternatives and similar repositories for neurips2022_exploration-guided-reward-shaping
Users that are interested in neurips2022_exploration-guided-reward-shaping are comparing it to the libraries listed below
Sorting:
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 5 months ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆74Updated 5 months ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆75Updated last year
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- ☆46Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆102Updated 2 years ago
- The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆55Updated last year
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated 2 years ago
- This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.☆46Updated last year
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆54Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆107Updated 3 years ago
- ☆42Updated 3 years ago
- Deep Implicit Coordination Graphs☆41Updated 11 months ago
- Constrained Policy Optimization implementation on Safety Gym☆27Updated 3 years ago
- ☆38Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 5 years ago
- There will be updates later☆84Updated 6 years ago
- ☆93Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆170Updated last year
- ☆16Updated 2 years ago
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆41Updated 6 months ago
- Implementation of Hierarchical Deep Q-Learning (Kulkarni et al., 2016)☆35Updated 5 years ago
- ☆96Updated 3 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆155Updated last year
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 4 years ago
- CORRO code☆35Updated 2 years ago