XinJingHao / SAC-Continuous-PytorchLinks
a clean and robust Pytorch implementation of SAC on continuous action space
☆89Updated 7 months ago
Alternatives and similar repositories for SAC-Continuous-Pytorch
Users that are interested in SAC-Continuous-Pytorch are comparing it to the libraries listed below
Sorting:
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆105Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆165Updated last year
- Jax and Torch Multi-Agent SAC on PettingZoo API☆93Updated 11 months ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆89Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆201Updated last year
- implementation of MADDPG using PettingZoo and PyTorch☆158Updated 2 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆94Updated 2 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆146Updated last year
- DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic☆413Updated 4 months ago
- TD3 in Pytorch☆35Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆54Updated 8 months ago
- Simple and efficient implementation of DQN DDPG TD3 SAC PPO MADDPG MATD3 MASAC MAAC IPPO MAPPO HAPPO MAT MORL☆128Updated 4 months ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆56Updated 3 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆41Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆95Updated last year
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆128Updated 4 months ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆56Updated 3 years ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆64Updated 2 years ago
- The implementation of LSTM-TD3.☆85Updated 2 years ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆174Updated last year
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆222Updated 6 years ago
- This is the official implementation of Multi-Agent PPO.☆122Updated 2 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆54Updated 3 years ago
- DSAC; Distributional Soft Actor-Critic☆133Updated 9 months ago
- UAV Logistics Environment for Multi-Agent Reinforcement Learning / Unity ML-Agents / Unity 3D☆106Updated last year
- ☆106Updated 4 years ago
- A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm☆365Updated 4 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆94Updated 4 years ago
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆34Updated 2 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆161Updated 2 years ago