XinJingHao / SAC-Continuous-Pytorch
a clean and robust Pytorch implementation of SAC on continuous action space
☆75Updated last month
Alternatives and similar repositories for SAC-Continuous-Pytorch
Users that are interested in SAC-Continuous-Pytorch are comparing it to the libraries listed below
Sorting:
- implementation of MADDPG using PettingZoo and PyTorch☆140Updated last year
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆128Updated last year
- Jax and Torch Multi-Agent SAC on PettingZoo API☆81Updated 5 months ago
- Simple and efficient implementation of DQN DDPG TD3 SAC PPO MADDPG MATD3 MASAC MAAC IPPO MAPPO HAPPO MAT MORL☆64Updated 3 weeks ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆100Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆170Updated last year
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆48Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 2 months ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆56Updated 3 years ago
- ☆96Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆145Updated 11 months ago
- The official code releasement of publications in MARL field of TJU RL lab.☆77Updated 2 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆36Updated 6 months ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆46Updated 3 years ago
- ☆203Updated last year
- This is the official implementation of Multi-Agent PPO.☆106Updated 2 years ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆140Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- ☆103Updated 3 months ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆65Updated 3 years ago
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆32Updated 2 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆36Updated 3 years ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆62Updated 2 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆84Updated 4 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆341Updated last month
- The implementation of LSTM-TD3.☆79Updated 2 years ago
- TD3 in Pytorch☆33Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 3 months ago
- Transformer in RL for decision-making☆97Updated 2 years ago