xihuai18 / A2PO-ICLR2023View external linksLinks
Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)
☆32Nov 22, 2025Updated 2 months ago
Alternatives and similar repositories for A2PO-ICLR2023
Users that are interested in A2PO-ICLR2023 are comparing it to the libraries listed below
Sorting:
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆22Jan 22, 2024Updated 2 years ago
- (AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆12May 22, 2023Updated 2 years ago
- ☆222Jun 4, 2023Updated 2 years ago
- ☆16Oct 6, 2019Updated 6 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆173Jan 7, 2024Updated 2 years ago
- we're building an AI to play the board game Diplomacy!☆35Mar 27, 2022Updated 3 years ago
- ☆33Dec 8, 2022Updated 3 years ago
- ☆481Dec 28, 2023Updated 2 years ago
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆42Oct 14, 2023Updated 2 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Official pytorch implementation of the paper <Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts>.☆21Nov 22, 2025Updated 2 months ago
- A collection of deep reinforcement learning algorithm implementations☆11Jan 9, 2020Updated 6 years ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆76Jun 9, 2023Updated 2 years ago
- ☆25Feb 21, 2022Updated 3 years ago
- ☆49Jul 23, 2021Updated 4 years ago
- ☆11Apr 23, 2021Updated 4 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆87Jul 15, 2022Updated 3 years ago
- The code for paper 'STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning'☆15Oct 6, 2024Updated last year
- RLA is a tool for managing your RL experiments automatically☆32Jan 11, 2025Updated last year
- ☆14Jun 26, 2019Updated 6 years ago
- Code accompanying paper "Coordinated Proximal Policy Optimization"☆11Mar 26, 2022Updated 3 years ago
- ☆13Jul 9, 2018Updated 7 years ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆41Jul 24, 2025Updated 6 months ago
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 4 years ago
- Scalable MCTS for team scenarios☆16Jun 14, 2024Updated last year
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- ☆13Nov 22, 2022Updated 3 years ago
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- Companion code to TRC paper: Daniel A. Lazar, Erdem Bıyık, Dorsa Sadigh, Ramtin Pedarsani. "Learning how to Dynamically Route Autonomous …☆16Aug 9, 2021Updated 4 years ago
- [AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition☆38Jun 3, 2024Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆368Mar 16, 2023Updated 2 years ago
- ☆12Aug 15, 2020Updated 5 years ago
- code for ROMANCE☆14Oct 12, 2024Updated last year
- Lexicographic Multi-Objective Reinforcement Learning☆16May 15, 2023Updated 2 years ago
- Official implementation of HARL algorithms based on PyTorch.☆856Apr 27, 2025Updated 9 months ago
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆34Dec 8, 2023Updated 2 years ago
- [NeurIPS' 24] The PyTorch implementation of our paper: "Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learnin…☆21Oct 10, 2024Updated last year
- Public implementation of Heterogeneous Policy Networks (HetNet) from AAMAS'22 -- Paper Title: Learning Efficient Diverse Communication fo…☆21Apr 23, 2024Updated last year