wisnunugroho21 / asynchronous_impala_PPOLinks

Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation

☆36

Alternatives and similar repositories for asynchronous_impala_PPO

Users that are interested in asynchronous_impala_PPO are comparing it to the libraries listed below

Sorting:

ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆96Updated 5 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆52Updated 2 years ago
shariqiqbal2810 / REFIL
Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021
☆65Updated 4 years ago
acyclics / MPO
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
☆27Updated 4 years ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆170Updated 3 years ago
danielwillemsen / MAMBPO
DecentralizedLearning
☆24Updated 2 years ago
daisatojp / mpo
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
☆76Updated 2 years ago
011235813 / hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
☆103Updated 3 years ago
AnujMahajanOxf / MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
☆58Updated 3 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
IouJenLiu / CMAE
☆49Updated 3 years ago
baitingzbt / PEDA
Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023
☆32Updated 6 months ago
oxwhirl / comix
☆44Updated 4 years ago
ymzhang01 / focops
Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).
☆26Updated 3 years ago
sfujim / TD7
Author's PyTorch implementation of TD7 for online and offline RL
☆145Updated last year
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆50Updated last month
wsjeon / maddpg-rllib
MADDPG in Ray/RLlib
☆54Updated 5 years ago
LanqingLi1993 / FOCAL-ICLR
Code for FOCAL Paper Published at ICLR 2021
☆51Updated last year
uoe-agents / lb-foraging
Level-Based Foraging (LBF): A multi-agent reinforcement learning environment
☆46Updated 9 months ago
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
yalidu / liir
Learning Individual Intrinsic Reward in MARL
☆62Updated 2 years ago
quantumiracle / MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
☆49Updated last year
uoe-agents / seac
The official code base of Shared Experience Actor-Critic (NeurIPS2020)
☆39Updated last year
garrett4wade / revisiting_marl
Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)
☆22Updated 2 years ago
JBLanier / pipeline-psro
Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
☆51Updated 9 months ago
lich14 / CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
☆86Updated 2 years ago
alirezakazemipour / DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
☆68Updated last year
SwapnilPande / MOReL
Model-Based Offline Reinforcement Learning
☆50Updated 4 years ago
Dragon-Zhuang / BPPO
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆87Updated last year
uoe-agents / smaclite
The Starcraft Multi-Agent challenge lite
☆42Updated 9 months ago