Steven-Ho / madrl-baselines
Some multiagent deep reinforcement learning algorithms and its PyTorch implementation.
☆11Updated 4 years ago
Related projects: ⓘ
- Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space☆29Updated 2 years ago
- my code for paper Parameterized-DQN☆19Updated 3 years ago
- PyTorch implementation of Constrained Reinforcement Learning for Soft Actor Critic Algorithm☆30Updated 2 years ago
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆12Updated 3 years ago
- Nash Q Learning☆30Updated 3 years ago
- Multi Agent adaptation of Soft Actor Critic Reinforcement Learning Algorithm☆11Updated 5 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆54Updated 2 years ago
- PyTorch implementation of MATD3☆12Updated 4 years ago
- ☆15Updated 4 years ago
- Jax and Torch Multi-Agent SAC on PettingZoo API☆56Updated last year
- Hybrid action space reinforcement learning algorithms.☆12Updated 3 years ago
- qmix☆21Updated 4 years ago
- Deep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging simulated continuous control single…☆48Updated 5 years ago
- ☆35Updated 4 months ago
- Fully Cooperative Multi-Agent Deep Reinforcement Learning☆23Updated 4 years ago
- A pytorch implementation of Constrained Reinforcement Learning Algorithm, including Constrained Soft Actor Critic (Soft Actor Critic Lagr…☆19Updated last year
- Codes for Paper "Delay-Aware Multi-Agent Reinforcement Learning".☆49Updated 4 years ago
- ☆14Updated 3 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆39Updated 4 years ago
- Implementation of DyMA-CL, MARL algorithm☆22Updated 4 years ago
- ☆35Updated 2 years ago
- A novel preference-driven multi-objective reinforcement learning algorithm using a single policy network that covers the entire preferenc…☆21Updated 10 months ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆72Updated 3 years ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Updated 2 years ago
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆26Updated 3 years ago
- Accepted by AROB 2021. A car-agent navigates in complex traffic conditions by Mixed_Input_PPO_CNN_LSTM model.☆10Updated 3 years ago
- Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control☆10Updated last year
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆54Updated 2 years ago
- Implementation for mSAC methods in PyTorch☆36Updated 2 years ago
- 多智能体强化学习VDN、QMIX、QTRAN、QPLEX复现☆24Updated last year