RvuvuzelaM / self-attention-ppo-pytorchLinks
I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf
☆35Updated 2 years ago
Alternatives and similar repositories for self-attention-ppo-pytorch
Users that are interested in self-attention-ppo-pytorch are comparing it to the libraries listed below
Sorting:
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆193Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆80Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆142Updated last year
- A clean and robust Pytorch implementation of PPO on continuous action space.☆163Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 4 years ago
- This is the official implementation of Multi-Agent PPO.☆118Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆93Updated last year
- ☆102Updated 3 years ago
- ☆217Updated 2 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆116Updated 2 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆68Updated 2 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆71Updated 3 years ago
- ☆40Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆53Updated 7 months ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆86Updated 5 months ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆87Updated 4 years ago
- pytorch实现的一些MARL算法☆68Updated 4 years ago
- Transformer in RL for decision-making☆100Updated 2 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆94Updated 4 years ago
- Implementation of PPO Lagrangian in PyTorch☆50Updated 3 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆169Updated last year
- Solve BipedalWalkerHardcore-v2 with TD3☆91Updated 2 years ago
- The code for maddpg using pytorch☆170Updated 5 years ago
- The implementation of LSTM-TD3.☆85Updated 2 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆91Updated 10 months ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆168Updated last year
- The pytorch implementation of DGN on grid world and Starcraft☆147Updated 3 years ago
- Implementation for mSAC methods in PyTorch☆42Updated 4 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆50Updated 5 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆60Updated 5 years ago