XinJingHao / SAC-Continuous-Pytorch
a clean and robust Pytorch implementation of SAC on continuous action space
☆73Updated last week
Alternatives and similar repositories for SAC-Continuous-Pytorch:
Users that are interested in SAC-Continuous-Pytorch are comparing it to the libraries listed below
- implementation of MADDPG using PettingZoo and PyTorch☆136Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆126Updated last year
- Simple and efficient implementation of DQN DDPG TD3 SAC PPO MADDPG MATD3 MASAC MAAC IPPO MAPPO HAPPO MAT MORL☆57Updated last week
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆99Updated 2 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆77Updated 5 months ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆47Updated 2 years ago
- ☆96Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆43Updated 3 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆56Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆166Updated last year
- PyTorch implementations of MADDPG, MAPPO (coming)☆138Updated last year
- The implementation of LSTM-TD3.☆79Updated 2 years ago
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆35Updated 6 months ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆81Updated 4 years ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆62Updated 2 years ago
- ☆200Updated last year
- The official code releasement of publications in MARL field of TJU RL lab.☆74Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- A clean and robust Pytorch implementation of PPO on continuous action space.☆143Updated 10 months ago
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆318Updated last month
- ☆39Updated 3 weeks ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆36Updated 2 years ago
- TD3 in Pytorch☆31Updated 3 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆65Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 2 months ago
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆32Updated 2 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆207Updated 5 years ago
- This is the official implementation of Multi-Agent PPO.☆104Updated 2 years ago