wisnunugroho21 / asynchronous_impala_PPO
Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation
☆35Updated 4 years ago
Alternatives and similar repositories for asynchronous_impala_PPO:
Users that are interested in asynchronous_impala_PPO are comparing it to the libraries listed below
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆42Updated 5 months ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆76Updated last year
- DecentralizedLearning☆24Updated 2 years ago
- ☆42Updated 3 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- ☆47Updated 3 years ago
- The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆39Updated 4 months ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆57Updated 2 years ago
- There will be updates later☆84Updated 5 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆19Updated 3 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆33Updated 2 weeks ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆132Updated last year
- Learning Individual Intrinsic Reward in MARL☆63Updated 2 years ago
- Distributional Soft Actor Critic☆51Updated 4 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆49Updated 3 years ago
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆40Updated last month
- ☆38Updated 2 years ago
- Collection of OpenAI parametrized action-space environments.☆62Updated 2 years ago
- ☆29Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆162Updated 2 years ago
- An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch☆45Updated 2 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆48Updated 6 months ago
- The official code base of Shared Experience Actor-Critic (NeurIPS2020)☆37Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆100Updated 3 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year