zouchangjie / RL-Nash-Q-learning
强化学习中纳什Qlearning 实现矩阵博弈
☆28Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for RL-Nash-Q-learning
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆19Updated 2 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆57Updated last year
- Nash Q Learning☆30Updated 3 years ago
- Deep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging simulated continuous control single…☆49Updated 5 years ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆31Updated 2 years ago
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆25Updated last year
- ☆14Updated 3 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆48Updated 3 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆22Updated 6 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆39Updated 2 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆36Updated 2 years ago
- Project on multi agent reinforcement learning applied on patrolling agents☆38Updated 4 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆13Updated 4 years ago
- my code for paper Parameterized-DQN☆20Updated 3 years ago
- Codes for paper of 'Solving job scheduling problems in a resource preemption environment with multi-agent reinforcement learning'☆35Updated last year
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆75Updated 3 years ago
- 无人机动态覆盖控制;1. 实现了一个无人机点覆盖环境;2. 给出了无人机连通保持规则;3. 给出了基于MARL的控制算法☆37Updated 2 months ago
- 多智能体强化学习☆83Updated 5 years ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆42Updated 4 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆55Updated 2 years ago
- scalable multi agents reinforcement learning☆54Updated 6 years ago
- IQL, QMIX, VDN, COMA, QTRAN (QTRAN-Base and QTRAN-Alt), MAVEN, CommNet, DYMA-Cl, G2ANet, and MADDPG☆17Updated 2 years ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆49Updated last year
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆178Updated 2 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆28Updated 2 years ago
- pytorch实现的一些MARL算法☆63Updated 3 years ago
- Use Multi-Agent Deep Deterministic Policy Gradient(DDPG) algorithm to find reasonable paths for ships☆28Updated 2 years ago
- Code for implementing/applying ODM*, PPO, MAAC, IC3Net and PRIMAL (PPO version) on a Multi-Agent gridworld environment.☆29Updated 3 years ago
- Deep Q Network for Multi-agent RL☆15Updated 4 years ago