deligentfool / policy_based_RL
The implement of the policy gradient RL algorithm with pytorch
☆38Updated 4 years ago
Alternatives and similar repositories for policy_based_RL:
Users that are interested in policy_based_RL are comparing it to the libraries listed below
- There will be updates later☆84Updated 5 years ago
- Code for Weighted QMIX☆129Updated 4 years ago
- pytorch实现的一些MARL算法☆65Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆115Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆49Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆57Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated 4 years ago
- ☆94Updated 3 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆93Updated 3 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆118Updated 10 months ago
- ☆91Updated 4 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆17Updated 6 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆99Updated 3 years ago
- ☆40Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆83Updated last year
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆79Updated 4 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆61Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆69Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆13Updated 5 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆86Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆159Updated 10 months ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆149Updated last year
- ☆193Updated last year
- This is the official implementation of Multi-Agent PPO.☆102Updated 2 years ago
- ☆41Updated 3 years ago
- The code for maddpg using pytorch☆165Updated 4 years ago
- Project on multi agent reinforcement learning applied on patrolling agents☆38Updated 5 years ago
- ☆83Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆31Updated 3 years ago