raincchio / P3OLinks
☆11Updated 10 months ago
Alternatives and similar repositories for P3O
Users that are interested in P3O are comparing it to the libraries listed below
Sorting:
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆24Updated last year
- ☆18Updated 4 years ago
- ☆28Updated 3 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Updated 2 years ago
- Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".☆21Updated 2 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 5 months ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆12Updated 3 years ago
- ☆26Updated 3 years ago
- ☆12Updated 4 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- Code for Adapting Environment Sudden Changes by Learning Context Sensitive Policy☆20Updated 3 years ago
- ☆32Updated 2 years ago
- Meta RL codebase for Unstable Baselines☆21Updated 2 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Updated 3 years ago
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆14Updated last year
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14Updated 4 years ago
- Code for "Learning Structured Communication for Multi-Agent Reinforcement Learning" (ICLR 2020 OpenReview)☆9Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆28Updated 3 years ago
- Pytorch implementation of AREL☆15Updated 3 years ago
- ☆12Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆20Updated 3 years ago
- Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)☆21Updated 3 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆13Updated last year
- Delayed RL agent for non-Atari tasks, from "Acting in Delayed Environments with Non-Stationary Markov Policies", ICLR 2021.☆14Updated last year
- ☆20Updated 4 years ago
- ☆11Updated last year
- Revisiting Discrete Gradient Estimation in MADDPG☆24Updated 2 years ago
- ☆49Updated 3 years ago