machine-teaching-group / neurips2022_exploration-guided-reward-shapingLinks
☆14Updated 2 years ago
Alternatives and similar repositories for neurips2022_exploration-guided-reward-shaping
Users that are interested in neurips2022_exploration-guided-reward-shaping are comparing it to the libraries listed below
Sorting:
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆30Updated last year
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated 2 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆103Updated 2 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- ☆38Updated 3 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆77Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆79Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆56Updated 2 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆14Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆86Updated 2 years ago
- ☆48Updated this week
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆41Updated 3 years ago
- The implementation of AAAI 2022 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆56Updated last year
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆74Updated 5 months ago
- ☆22Updated last year
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆42Updated 7 months ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 4 years ago
- ☆94Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆110Updated 4 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 5 months ago
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆33Updated last year
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago
- ☆43Updated 3 years ago
- ☆28Updated 3 years ago
- Code for "ALMA: Hierarchical Learning for Composite Multi-Agent Tasks" NeurIPS 2022☆28Updated 2 years ago
- Deep Implicit Coordination Graphs☆41Updated last year
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- There will be updates later☆84Updated 6 years ago
- This is the official implementation of Multi-Agent PPO.☆106Updated 2 years ago