indigoLovee / TD3
TD3 in Pytorch
☆29Updated 3 years ago
Alternatives and similar repositories for TD3:
Users that are interested in TD3 are comparing it to the libraries listed below
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆94Updated 2 years ago
- PyTorch implementations of MADDPG, MAPPO (coming)☆102Updated 10 months ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆114Updated 8 months ago
- Use Multi-agent Twin Delayed Deep Deterministic Policy Gradient(TD3) algorithm to find reasonable paths for ships☆54Updated 2 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆66Updated 7 months ago
- implementation of MADDPG using PettingZoo and PyTorch☆122Updated last year
- Solve BipedalWalkerHardcore-v2 with TD3☆83Updated last year
- Implement reinforcement learning algorithms in Pytorch☆33Updated 3 years ago
- Use Multi-Agent Deep Deterministic Policy Gradient(DDPG) algorithm to find reasonable paths for ships☆30Updated 2 years ago
- The implementation of LSTM-TD3.☆72Updated last year
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆52Updated 3 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆41Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆83Updated last year
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆17Updated 6 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆13Updated 4 years ago
- RL algorithms☆140Updated 3 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆42Updated 2 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆79Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆47Updated 4 years ago
- 无人机动态覆盖控制;1. 实现了一个无人机点覆盖环境;2. 给出了无人机连通保持规则;3. 给出了基于MARL的控制算法☆44Updated 4 months ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆68Updated last month
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆28Updated last year
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆60Updated 2 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆32Updated 2 years ago
- My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0☆29Updated 2 years ago
- D3QN Pytorch☆57Updated 3 years ago
- UAV Logistics Environment for Multi-Agent Reinforcement Learning / Unity ML-Agents / Unity 3D☆86Updated 9 months ago
- This is the source code of "Efficient training techniques for multi-agent reinforcement learning in combatant tasks".☆72Updated 4 years ago
- POMG algorithm for large-scale pursuit game with partial observation and no communication.☆20Updated last year
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆139Updated last year