gingkg / multiagent-particle-envsLinks
多智能体小球强化学习环境
☆20Updated 4 years ago
Alternatives and similar repositories for multiagent-particle-envs
Users that are interested in multiagent-particle-envs are comparing it to the libraries listed below
Sorting:
- implementation of MADDPG using PettingZoo and PyTorch☆156Updated 2 years ago
- 多智能体强化学习☆103Updated 6 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆207Updated 3 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆104Updated 3 years ago
- 多智能体学习库☆21Updated 3 years ago
- PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.☆504Updated 2 years ago
- A Collection of Multi-Agent Reinforcement Learning (MARL) Resources☆250Updated 3 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆88Updated 6 months ago
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆33Updated 2 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆20Updated 7 years ago
- A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm☆363Updated 4 years ago
- Heterogeneous Hierarchical Multi Agent Reinforcement Learning for Air Combat☆138Updated 6 months ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆54Updated 3 years ago
- ☆43Updated 5 years ago
- Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.☆658Updated 3 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆92Updated 11 months ago
- RL algorithms☆141Updated 4 years ago
- Lightweight version of MAPPO to help you quickly migrate to your local environment.☆741Updated last week
- A clean and robust Pytorch implementation of PPO on continuous action space.☆165Updated last year
- Simple and efficient implementation of DQN DDPG TD3 SAC PPO MADDPG MATD3 MASAC MAAC IPPO MAPPO HAPPO MAT MORL☆126Updated 3 months ago
- 基于强化学习的空战对抗☆76Updated 4 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆89Updated 4 years ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆404Updated 3 months ago
- 利用值函数逼近网络设计无人机空战自主决策系统,目前是初步的程序编写,之后会不断更新和详解。☆71Updated 6 years ago
- The code for maddpg using pytorch☆169Updated 5 years ago
- Multi-agent Combat Arena (UAV swarm vs UAV swarm)☆149Updated 5 years ago
- RL Dresden Algorithm Suite☆32Updated last year
- D3QN Pytorch☆62Updated 3 years ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆63Updated 2 years ago
- Tutorial for Reinforcement Learning☆189Updated 3 years ago