Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)
☆23Jul 16, 2022Updated 3 years ago
Alternatives and similar repositories for revisiting_marl
Users that are interested in revisiting_marl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning☆11Oct 18, 2022Updated 3 years ago
- A modular implementation of PPO, and soon hopefully other algorithms.☆26Jan 16, 2024Updated 2 years ago
- I2Q: A Fully Decentralized Q-Learning Algorithm☆19Nov 10, 2022Updated 3 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Nov 29, 2022Updated 3 years ago
- Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".☆13Jan 25, 2023Updated 3 years ago
- Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics.☆22Sep 11, 2023Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆88Apr 3, 2023Updated 3 years ago
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆46Oct 31, 2024Updated last year
- ☆227Jun 4, 2023Updated 2 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38May 16, 2023Updated 3 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [NeurIPS 2022] ASPiRe: Adaptive Skill Priors for Reinforcement Learning☆13Oct 19, 2022Updated 3 years ago
- ☆25Apr 16, 2024Updated 2 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆27May 12, 2025Updated last year
- Code for experimenting with state and action abstractions in reinforcement learning.☆29Dec 11, 2020Updated 5 years ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆116Jan 16, 2024Updated 2 years ago
- The core repository of the elsciRL framework.☆18Dec 8, 2025Updated 5 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 6 months ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆15Jan 3, 2023Updated 3 years ago
- It's the pytorch implementation of google research football.☆43Jun 14, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- Implements the Messenger environment and EMMA model.☆25Jun 14, 2023Updated 2 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆20Mar 10, 2021Updated 5 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated 2 years ago
- Training Multiple agents in the same environment to collaborate and compete with each other☆12Dec 1, 2019Updated 6 years ago
- ☆16May 5, 2022Updated 4 years ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆45May 18, 2026Updated last week
- ☆507Dec 28, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Apr 13, 2023Updated 3 years ago
- This is the official implementation of Multi-Agent PPO (MAPPO).☆2,004Jul 18, 2024Updated last year
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 3 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆89Dec 8, 2022Updated 3 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Jul 18, 2025Updated 10 months ago
- Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning☆10Nov 14, 2021Updated 4 years ago