YangShengqi / paper
☆42Updated 2 years ago
Alternatives and similar repositories for paper:
Users that are interested in paper are comparing it to the libraries listed below
- Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning☆40Updated 3 years ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆73Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆125Updated last month
- multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)☆11Updated 5 years ago
- ☆123Updated 3 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- There will be updates later☆84Updated 5 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆85Updated 4 years ago
- ☆162Updated last year
- Deep Implicit Coordination Graphs☆41Updated 9 months ago
- ☆120Updated 2 years ago
- Assignments for CS294-112.☆30Updated 5 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- ☆32Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆106Updated 3 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 4 years ago
- ☆42Updated 3 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆163Updated 11 months ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆60Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated last year
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆30Updated 3 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆26Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆71Updated 2 years ago
- ☆43Updated 4 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆131Updated 4 years ago
- ☆38Updated 2 years ago
- ☆28Updated 3 years ago