deligentfool / policy_based_RLLinks
The implement of the policy gradient RL algorithm with pytorch
☆38Updated 4 years ago
Alternatives and similar repositories for policy_based_RL
Users that are interested in policy_based_RL are comparing it to the libraries listed below
Sorting:
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- Code for Weighted QMIX☆139Updated 4 years ago
- The code for maddpg using pytorch☆170Updated 4 years ago
- ☆96Updated 4 years ago
- The implement of all kinds of dqn reinforcement learning with Pytorch☆93Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆180Updated last year
- ☆211Updated 2 years ago
- ☆101Updated 3 years ago
- pytorch实现的一些MARL算法☆67Updated 4 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆65Updated 2 years ago
- There will be updates later☆84Updated 6 years ago
- This is the official implementation of Multi-Agent PPO.☆109Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 5 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆288Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 4 months ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆103Updated 3 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆160Updated last year
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆217Updated 6 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆87Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆156Updated last year
- ☆76Updated 5 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆14Updated 5 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆65Updated 6 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆140Updated last year
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆106Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆142Updated 6 years ago
- ☆47Updated 5 years ago