ImmanuelXIV / ppo-self-playLinks
Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment
☆21Updated 3 weeks ago
Alternatives and similar repositories for ppo-self-play
Users that are interested in ppo-self-play are comparing it to the libraries listed below
Sorting:
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆72Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆70Updated last year
- MATE: the Multi-Agent Tracking Environment.☆44Updated 2 years ago
- ☆39Updated 3 years ago
- jinxinglim / Game-Theoretical-Approaches-in-Multi-Agent-Reinforcement-Learning-Policy-Space-Response-Oracles☆16Updated 6 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆43Updated 6 years ago
- MATE: the Multi-Agent Tracking Environment.☆48Updated 2 years ago
- Implementation of DyMA-CL, MARL algorithm☆28Updated 5 years ago
- ☆49Updated 4 years ago
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆42Updated 2 years ago
- ☆42Updated 4 years ago
- Implementation of centralized training, centralized execution of Soft Actor-Critic (SAC) on a Tennis multiagent Unity environment.☆40Updated 4 years ago
- DecentralizedLearning☆24Updated 3 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆63Updated 2 years ago
- Heterogeneous Multi-Robot Reinforcement Learning☆60Updated last month
- ☆30Updated 4 years ago
- Deep Implicit Coordination Graphs☆43Updated last year
- Value-Decomposition Multi-Agent Actor-Critics☆41Updated 3 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆44Updated 4 years ago
- Code for "ALMA: Hierarchical Learning for Composite Multi-Agent Tasks" NeurIPS 2022☆31Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆74Updated 2 years ago
- Implementations of safe reinforcement learning algorithms☆29Updated last year
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆81Updated 2 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆58Updated 3 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆56Updated 2 years ago
- Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)☆32Updated 4 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆31Updated last month
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆41Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆215Updated last year
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆36Updated 4 years ago