ThousandOfWind / DRL-baseline
Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C
☆19Updated 4 years ago
Alternatives and similar repositories for DRL-baseline:
Users that are interested in DRL-baseline are comparing it to the libraries listed below
- simple code to reinforcement learning☆19Updated 4 years ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆45Updated 2 years ago
- 强化学习中纳什Qlearning 实现矩阵博弈☆30Updated 6 years ago
- A pytorch implementation of Constrained Reinforcement Learning Algorithm, including Constrained Soft Actor Critic (Soft Actor Critic Lagr…☆34Updated last year
- Nash Q Learning☆31Updated 4 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆59Updated 4 years ago
- Official implementation for the NeurIPS 2023 paper: "Reduced Policy Optimization for Continuous Control with Hard Constraints"☆32Updated 11 months ago
- Transformer-based Multi-Agent Actor-Critic Framework☆44Updated 2 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆72Updated 4 months ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆57Updated 3 years ago
- Simple implementation for Constrained Policy Optimization in Pytorch☆15Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆63Updated 3 years ago
- Some multiagent deep reinforcement learning algorithms and its PyTorch implementation.☆11Updated 5 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated last month
- reinforcement learning algorithm for multi-objective optimization problem☆16Updated 4 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆96Updated 2 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆79Updated 4 years ago
- A clean and robust implementation of Prioritized DQN and Prioritized Double DQN☆19Updated 9 months ago
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆42Updated 3 years ago
- Algorithm that combines QMIX with SAC for Multi-Agent Reinforcement Learning.☆46Updated 2 years ago
- 一些利用pytorch编程实现的强化学习例子☆36Updated 5 years ago
- 多智能体学习库☆18Updated 3 years ago
- Paper list for constrained policy optimization in reinforcement learning.