ImmanuelXIV / ppo-self-playLinks
Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment
☆20Updated 2 years ago
Alternatives and similar repositories for ppo-self-play
Users that are interested in ppo-self-play are comparing it to the libraries listed below
Sorting:
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆70Updated last year
- Collection of OpenAI parametrized action-space environments.☆66Updated 5 months ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆65Updated last year
- MATE: the Multi-Agent Tracking Environment.☆48Updated 2 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆41Updated 6 years ago
- ☆43Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆189Updated last year
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆68Updated 2 years ago
- ☆39Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆60Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- MATE: the Multi-Agent Tracking Environment.☆43Updated 2 years ago
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆44Updated 10 months ago
- DSAC; Distributional Soft Actor-Critic☆131Updated 7 months ago
- Heterogeneous Multi-Robot Reinforcement Learning☆52Updated last year
- Value-Decomposition Multi-Agent Actor-Critics☆39Updated 2 years ago
- A collection of recent MARL papers☆96Updated 9 months ago
- Deep Implicit Coordination Graphs☆43Updated last year
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆29Updated 9 months ago
- Code for the NeurIPS 2023 Paper: Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Sta…☆26Updated last year
- Implementation of DyMA-CL, MARL algorithm☆28Updated 5 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆80Updated 3 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆25Updated 2 years ago
- jinxinglim / Game-Theoretical-Approaches-in-Multi-Agent-Reinforcement-Learning-Policy-Space-Response-Oracles☆15Updated 5 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆48Updated 6 years ago
- DecentralizedLearning☆25Updated 2 years ago
- Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)☆27Updated 3 years ago
- Code for "On the Robustness of Safe Reinforcement Learning under Observational Perturbations" (ICLR 2023)☆45Updated 9 months ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆104Updated 3 years ago
- ☆75Updated last year