machine-teaching-group / neurips2022_exploration-guided-reward-shaping
☆13Updated 2 years ago
Alternatives and similar repositories for neurips2022_exploration-guided-reward-shaping:
Users that are interested in neurips2022_exploration-guided-reward-shaping are comparing it to the libraries listed below
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆69Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- [NeurIPS 2022] Code for Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments☆12Updated 2 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆26Updated 2 months ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.☆45Updated last year
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆99Updated 3 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆58Updated 4 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆71Updated 2 months ago
- The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .☆58Updated 3 months ago
- ☆17Updated 2 years ago
- ☆20Updated last year
- ☆42Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆102Updated 2 years ago
- ☆36Updated 2 years ago
- ☆42Updated 2 years ago
- Code for the NeurIPS 2023 Paper: Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Sta…☆20Updated last year
- PyTorch implementation of Constrained Policy Optimization☆51Updated 3 years ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆26Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆12Updated last year
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆51Updated last year
- ☆38Updated 2 years ago
- ELIGN: Expectation Alignment as a Multi-agent Intrinsic Reward☆18Updated 2 years ago
- Public implementation of "Multi-Agent Graph-Attention Communication and Teaming" from AAMAS'21☆75Updated 9 months ago
- Constrained Policy Optimization implementation on Safety Gym☆23Updated 3 years ago
- Code for ICML2023 accepted paper: Complementary Attention for Multi-Agent Reinforcement Learning.☆16Updated last year
- ☆28Updated 3 years ago