longtermrisk / marltoolboxLinks
A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
☆32Updated 2 years ago
Alternatives and similar repositories for marltoolbox
Users that are interested in marltoolbox are comparing it to the libraries listed below
Sorting:
- Gridworld for MARL experiments☆141Updated 4 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18Updated 3 years ago
- PAIRED in PyTorch 🔥☆62Updated 2 years ago
- Implementation of USFAs: https://arxiv.org/pdf/1812.07626.pdf☆9Updated 6 years ago
- ☆306Updated 7 months ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆19Updated 2 years ago
- A tool for aggregating and plotting MARL experiment data.☆77Updated 6 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆147Updated 2 years ago
- The Implementation of "Machine Theory of Mind", ICML 2018☆25Updated 3 years ago
- Gridworld domains in the gym interface☆28Updated 10 months ago
- Benchmarking RL generalization in an interpretable way.☆159Updated last month
- ☆235Updated 8 months ago
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆208Updated 4 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 2 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆58Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆182Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- impact-driven-exploration☆131Updated last year
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆152Updated last year
- PyTorch implementation of DreamerV2 model-based RL algorithm☆223Updated 2 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆109Updated 2 years ago
- Learning to Incentivize Other Learning Agents☆34Updated 3 years ago
- ☆200Updated 2 years ago
- Partially Observable Process Gym☆196Updated 2 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆88Updated 4 years ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆46Updated 3 years ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆22Updated 9 months ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆329Updated 11 months ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆160Updated 2 years ago
- The Starcraft Multi-Agent challenge lite☆41Updated 10 months ago