ImmanuelXIV / ppo-self-playLinks
Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment
☆20Updated 2 years ago
Alternatives and similar repositories for ppo-self-play
Users that are interested in ppo-self-play are comparing it to the libraries listed below
Sorting:
- ☆39Updated 3 years ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆70Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆66Updated last year
- MATE: the Multi-Agent Tracking Environment.☆43Updated 2 years ago
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆44Updated 11 months ago
- MATE: the Multi-Agent Tracking Environment.☆47Updated 2 years ago
- Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"☆61Updated 2 years ago
- Deep Implicit Coordination Graphs☆43Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆104Updated 3 years ago
- ☆43Updated 3 years ago
- jinxinglim / Game-Theoretical-Approaches-in-Multi-Agent-Reinforcement-Learning-Policy-Space-Response-Oracles☆15Updated 6 years ago
- ☆49Updated 4 years ago
- A collection of recent MARL papers☆96Updated 10 months ago
- ☆14Updated 3 years ago
- DecentralizedLearning☆25Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆192Updated last year
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆68Updated 2 years ago
- Implementations of safe reinforcement learning algorithms☆28Updated last year
- Implementation of DyMA-CL, MARL algorithm☆28Updated 5 years ago
- The Starcraft Multi-Agent challenge lite☆40Updated last year
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆36Updated last year
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆30Updated 9 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- ☆29Updated 4 years ago
- Code for "ALMA: Hierarchical Learning for Composite Multi-Agent Tasks" NeurIPS 2022☆30Updated 3 years ago
- curriculum☆25Updated 2 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆41Updated 6 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆20Updated 3 years ago
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆36Updated 4 years ago
- ☆76Updated last year