gingkg / multiagent-particle-envsLinks
多智能体小球强化学习环境
☆18Updated 4 years ago
Alternatives and similar repositories for multiagent-particle-envs
Users that are interested in multiagent-particle-envs are comparing it to the libraries listed below
Sorting:
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆48Updated 3 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆201Updated 3 years ago
- 多智能体强化学习☆99Updated 6 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆37Updated 3 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆100Updated 3 years ago
- implementation of MADDPG using PettingZoo and PyTorch☆148Updated last year
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆32Updated 2 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆59Updated 3 years ago
- 多智能体学习库☆20Updated 3 years ago
- A collection of multi agent environments based on OpenAI gym.☆26Updated last year
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆71Updated 3 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆85Updated 7 months ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆15Updated 5 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆139Updated last year
- Solve BipedalWalkerHardcore-v2 with TD3☆90Updated 2 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆19Updated 7 years ago
- 强化学习中纳什Qlearning 实现矩阵博弈☆30Updated 6 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆86Updated 4 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆81Updated 2 months ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 4 months ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆48Updated 3 years ago
- RL algorithms☆142Updated 4 years ago
- ☆43Updated 5 years ago
- Multi-agent Combat Arena (UAV swarm vs UAV swarm)☆138Updated 4 years ago
- ☆13Updated 5 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆53Updated 5 years ago
- RLlib超参数详解(中文)☆18Updated 3 years ago
- The code for maddpg using pytorch☆169Updated 4 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆43Updated 8 months ago