RvuvuzelaM / self-attention-ppo-pytorch
I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf
☆33Updated 2 years ago
Alternatives and similar repositories for self-attention-ppo-pytorch:
Users that are interested in self-attention-ppo-pytorch are comparing it to the libraries listed below
- The official code releasement of publications in MARL field of TJU RL lab.☆74Updated 2 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆65Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆126Updated last year
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆81Updated 4 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆61Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- Implementation for mSAC methods in PyTorch☆41Updated 3 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- ☆39Updated 2 years ago
- ☆96Updated 3 years ago
- Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Proble…☆48Updated last year
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆73Updated last week
- This is the official implementation of Multi-Agent PPO.☆104Updated 2 years ago
- The implementation of LSTM-TD3.☆79Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆166Updated last year
- ☆42Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆102Updated 2 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆54Updated last year
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆41Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- Value-Decomposition Networks For Cooperative Multi-Agent Learning☆22Updated 4 years ago
- ☆40Updated 3 years ago
- Implementation of DyMA-CL, MARL algorithm☆27Updated 5 years ago
- Implementation of PPO Lagrangian in PyTorch☆44Updated 2 years ago
- IQL, QMIX, VDN, COMA, QTRAN (QTRAN-Base and QTRAN-Alt), MAVEN, CommNet, DYMA-Cl, G2ANet, and MADDPG☆18Updated 3 years ago
- Collection of OpenAI parametrized action-space environments.☆64Updated last month
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆55Updated last year
- A collection of recent MARL papers☆88Updated 5 months ago