longtermrisk / marltoolboxLinks
A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
☆32Updated 3 years ago
Alternatives and similar repositories for marltoolbox
Users that are interested in marltoolbox are comparing it to the libraries listed below
Sorting:
- Gridworld for MARL experiments☆144Updated 5 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18Updated 3 years ago
- A tool for aggregating and plotting MARL experiment data.☆82Updated 2 weeks ago
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆156Updated 2 years ago
- ☆250Updated last year
- Benchmarking RL generalization in an interpretable way.☆173Updated 2 months ago
- ☆326Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆20Updated 3 years ago
- Learning to Incentivize Other Learning Agents☆35Updated 3 years ago
- SocialJax: sequential social dilemma environments☆65Updated 2 months ago
- Gridworld domains in the gym interface☆30Updated last year
- The Implementation of "Machine Theory of Mind", ICML 2018☆26Updated 3 years ago
- Official Repository for "Agent Modelling under Partial Observability for Deep Reinforcement Learning"☆41Updated 3 years ago
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆218Updated 4 years ago
- The Starcraft Multi-Agent challenge lite☆46Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆163Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆104Updated 3 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆51Updated last year
- ☆47Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- ☆202Updated 2 years ago
- Repo for reproduction of sequential social dilemmas☆412Updated 11 months ago
- PAIRED in PyTorch 🔥☆64Updated 2 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆59Updated 3 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Updated 2 years ago
- ☆31Updated 3 years ago
- impact-driven-exploration☆133Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- Partially Observable Process Gym☆211Updated 7 months ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆156Updated 4 years ago