machine-teaching-group / neurips2022_exploration-guided-reward-shapingLinks
☆14Updated 2 years ago
Alternatives and similar repositories for neurips2022_exploration-guided-reward-shaping
Users that are interested in neurips2022_exploration-guided-reward-shaping are comparing it to the libraries listed below
Sorting:
- ☆215Updated 2 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆159Updated last year
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆181Updated last year
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆80Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆80Updated 3 years ago
- The offcial implementation of "ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind" (ICLR 2022) .☆68Updated 9 months ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆60Updated 5 years ago
- ☆97Updated 4 years ago
- This is the official implementation of Multi-Agent PPO.☆113Updated 2 years ago
- PyTorch implementation of Constrained Policy Optimization☆55Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆105Updated 3 years ago
- This is the official implementation of ERL-Re2.☆65Updated last year
- ☆90Updated 3 weeks ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆141Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- ☆41Updated 3 years ago
- ☆101Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆59Updated 5 years ago
- 🚀 A fast safe reinforcement learning library in PyTorch☆204Updated 10 months ago
- ☆17Updated 2 years ago
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆118Updated last month
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆35Updated last year
- Code for Weighted QMIX☆138Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆69Updated last year
- Implementation of PPO Lagrangian in PyTorch☆50Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)☆71Updated 2 years ago
- Multi-Agent Reinforcement Learning (MARL) papers☆268Updated 2 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 7 months ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆33Updated last year