ThousandOfWind / DRL-baseline
Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C
☆19Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for DRL-baseline
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆31Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆121Updated 5 months ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆55Updated 4 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆53Updated 3 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆61Updated last year
- 强化学习中纳什Qlearning 实现矩阵博弈☆28Updated 5 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 3 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆40Updated 2 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆43Updated 3 years ago
- a clean and robust Pytorch implementation of SAC on continuous action space☆60Updated 5 months ago
- Nash Q Learning☆30Updated 3 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆57Updated 2 years ago
- pytorch实现的一些MARL算法☆64Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆111Updated last year
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆92Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆146Updated 7 months ago
- The implement of the policy gradient RL algorithm with pytorch☆37Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆45Updated 4 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆179Updated 2 years ago
- ☆71Updated 5 years ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆37Updated 2 years ago
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆19Updated 2 years ago
- The code for maddpg using pytorch☆162Updated 4 years ago
- demo of multi-agent reinforcement learning algorithms, such as ATT-MADDPG (Modelling the Dynamic Joint Policy of Teammates with Attention…☆50Updated 3 years ago
- qmix☆22Updated 4 years ago
- simple code to reinforcement learning☆19Updated 4 years ago
- Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x☆133Updated last year
- DSAC; Distributional Soft Actor-Critic☆114Updated 8 months ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆75Updated 3 years ago
- Official implementation for the NeurIPS 2023 paper: "Reduced Policy Optimization for Continuous Control with Hard Constraints"☆28Updated 7 months ago