EnnaSachdeva / Recurrent-Multiagent-Deep-Deterministic-Policy-Gradient-with-Difference-RewardsLinks
Deep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging simulated continuous control single agent tasks. These methods have further been extended to multiagent domains in cooperative, competitive or mixed environments. This paper primarily focuses on multiagent cooperative settings which can be model…
☆51Updated 6 years ago
Alternatives and similar repositories for Recurrent-Multiagent-Deep-Deterministic-Policy-Gradient-with-Difference-Rewards
Users that are interested in Recurrent-Multiagent-Deep-Deterministic-Policy-Gradient-with-Difference-Rewards are comparing it to the libraries listed below
Sorting:
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆14Updated 3 years ago
- A pytorch implementation of Constrained Reinforcement Learning Algorithm, including Constrained Soft Actor Critic (Soft Actor Critic Lagr…☆38Updated 2 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- Multi Agent adaptation of Soft Actor Critic Reinforcement Learning Algorithm☆20Updated 6 years ago
- Developed a Multi-Agent DDPG to solve Vehicle Scheduling problem.☆13Updated 2 years ago
- Source code of paper "A Hierarchical Reinforcement Learning Algorithm Based on Attention Mechanism for UAV Autonomous Navigation"☆21Updated 2 years ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆49Updated 2 years ago
- Fully Cooperative Multi-Agent Deep Reinforcement Learning☆27Updated 5 years ago
- Use Multi-Agent Deep Deterministic Policy Gradient(DDPG) algorithm to find reasonable paths for ships☆37Updated 2 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆48Updated 3 years ago
- ☆23Updated 4 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆15Updated 5 years ago
- ☆14Updated 4 years ago
- This repository includes a realization of the resilient projection-based consensus actor-critic algorithm that is resilient to adversaria…☆10Updated 3 years ago
- Project on multi agent reinforcement learning applied on patrolling agents☆39Updated 5 years ago
- Code for implementing/applying ODM*, PPO, MAAC, IC3Net and PRIMAL (PPO version) on a Multi-Agent gridworld environment.☆37Updated 4 years ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆64Updated 2 years ago
- MBaranPeker / Pursuit-Evasion-Game-with-Deep-Reinforcement-Learning-in-an-environment-with-an-obstacleIn this study, a multi agent chase-escape problem using Deep Q learning. Actors of the problem are smart evader and smart pursuers with o…☆23Updated last year
- my code for paper Parameterized-DQN☆22Updated 4 years ago
- POMG algorithm for large-scale pursuit game with partial observation and no communication.☆21Updated 4 months ago
- Decentralized deep multi-agent reinforcement learning in physical environments.☆14Updated 6 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆86Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 4 months ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆48Updated 3 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆71Updated 3 years ago
- Communication-efficient MARL for CACC☆26Updated last year
- Hierarchical and Stable Multiagent Reinforcement Learning for Cooperative Navigation Control☆13Updated 3 years ago
- code of paper 《Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem》☆17Updated 4 years ago
- using actor-critic method to dealing with the path-planning UAV problem☆18Updated 6 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆24Updated 7 years ago