blavad / marl
Multi-agent reinforcement learning framework
☆31Updated 4 years ago
Alternatives and similar repositories for marl:
Users that are interested in marl are comparing it to the libraries listed below
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆141Updated last year
- qmix☆22Updated 4 years ago
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆91Updated last year
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 3 years ago
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆112Updated 2 years ago
- Multiagent deep reinforcement learning research project☆27Updated 8 months ago
- BranchingDQN☆49Updated 6 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆101Updated 4 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆116Updated 3 months ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆130Updated 8 months ago
- ☆83Updated 3 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆202Updated 5 years ago
- Nash Q Learning☆30Updated 4 years ago
- scalable multi agents reinforcement learning☆56Updated 6 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- The code for maddpg using pytorch☆165Updated 4 years ago
- meta-MADDPG (Python implementation)☆18Updated 6 years ago
- Implement Google Deep Minds DQN for multiple agents for a grid world environment where vehicles must pick up customers.☆27Updated 6 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆79Updated 4 years ago
- Simple implementation for Constrained Policy Optimization in Pytorch☆15Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆161Updated 10 months ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆46Updated 5 years ago
- Project on multi agent reinforcement learning applied on patrolling agents☆38Updated 5 years ago
- multi-agent deep reinforcement learning for networked system control.☆402Updated 4 years ago
- QMIX implemented in TensorFlow 2☆17Updated 3 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- Multi-objective reinforcement learning deals with finding policies for tasks where there are multiple distinct criteria to optimize for. …☆20Updated 6 years ago
- Code for Weighted QMIX☆129Updated 4 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆149Updated last year