rmsander / marl_ppoLinks

A MARL PPO implementation with tf-agents, configured for the MultiCarRacing-v0 Gym environment.

☆21

Alternatives and similar repositories for marl_ppo

Users that are interested in marl_ppo are comparing it to the libraries listed below

Sorting:

wsjeon / maddpg-rllib
MADDPG in Ray/RLlib
☆54Updated 5 years ago
ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆96Updated 4 years ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 4 years ago
BY571 / SAC_discrete
PyTorch implementation of the discrete Soft-Actor-Critic algorithm.
☆52Updated 3 years ago
kantologist / multiagent-sac
Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.
☆36Updated 4 years ago
Felhof / DiscreteSAC
☆40Updated 3 years ago
parametersharingmadrl / parametersharingmadrl
☆28Updated 4 years ago
chauncygu / Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
☆174Updated last year
alirezakazemipour / Discrete-SAC-PyTorch
PyTorch implementation of discrete version of Soft Actor-Critic.
☆34Updated 3 years ago
011235813 / hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
☆103Updated 2 years ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆289Updated 4 years ago
thomashirtz / gym-hybrid
Collection of OpenAI parametrized action-space environments.
☆65Updated 2 months ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆317Updated 3 years ago
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
TJU-DRL-LAB / Multiagent-RL
The official code releasement of publications in MARL field of TJU RL lab.
☆79Updated 2 years ago
zoeyuchao / mappo
This is the official implementation of Multi-Agent PPO.
☆106Updated 2 years ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 6 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆141Updated 6 years ago
BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆123Updated 4 years ago
matteokarldonati / Counterfactual-Multi-Agent-Policy-Gradients
PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."
☆58Updated 5 years ago
Jonathan-Pearce / DDPG_PER
Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)
☆51Updated 3 months ago
nisheeth-golakiya / hybrid-sac
Single-file pytorch implementation of hybrid-SAC
☆58Updated 3 years ago
cyoon1729 / Multi-agent-reinforcement-learning
Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG
☆64Updated 5 years ago
AgrawalAmey / safe-explorer
Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]
☆72Updated 6 years ago
oxwhirl / facmac
☆96Updated 3 years ago
Bigpig4396 / PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA
☆76Updated 5 years ago
koulanurag / minimal-marl
Minimal implementation of multi-agent reinforcement learning algorithms
☆55Updated 3 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
watakandai / hiro_pytorch
Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)
☆110Updated 4 years ago
atavakol / action-branching-agents
(AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning
☆116Updated 2 years ago