jsztompka / MultiAgent-PPO
Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis
☆29Updated 6 years ago
Alternatives and similar repositories for MultiAgent-PPO:
Users that are interested in MultiAgent-PPO are comparing it to the libraries listed below
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆26Updated 5 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- There will be updates later☆84Updated 5 years ago
- ☆74Updated 9 months ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 5 years ago
- A Multi-agent Learning Framework☆62Updated 3 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago
- PyTorch IMPALA implementation☆25Updated 5 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆134Updated 7 months ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆117Updated 4 months ago
- Collection of OpenAI parametrized action-space environments.☆62Updated 2 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆85Updated 4 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- ☆52Updated 6 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆64Updated 5 years ago
- MADDPG in Ray/RLlib☆53Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆137Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Updated last week
- ☆48Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆100Updated 3 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆41Updated 7 months ago
- DSAC; Distributional Soft Actor-Critic☆125Updated last month
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆31Updated 6 years ago
- ☆92Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 7 years ago