longtermrisk / marltoolboxLinks
A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
☆32Updated 3 years ago
Alternatives and similar repositories for marltoolbox
Users that are interested in marltoolbox are comparing it to the libraries listed below
Sorting:
- Gridworld for MARL experiments☆144Updated 4 years ago
- ☆324Updated last year
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆216Updated 4 years ago
- ☆359Updated 3 years ago
- A tool for aggregating and plotting MARL experiment data.☆80Updated 11 months ago
- ☆201Updated 2 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Updated 2 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 3 years ago
- ☆47Updated last year
- Gridworld domains in the gym interface☆29Updated last year
- PAIRED in PyTorch 🔥☆63Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.☆173Updated last month
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆162Updated 4 years ago
- ☆246Updated last year
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆198Updated 2 years ago
- impact-driven-exploration☆132Updated 2 years ago
- Repo for reproduction of sequential social dilemmas☆409Updated 9 months ago
- A gym interface for AI safety gridworlds created in pycolab.☆18Updated 3 years ago
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆157Updated 2 years ago
- SocialJax: sequential social dilemma environments☆56Updated last month
- Code for the paper "Phasic Policy Gradient"☆267Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆92Updated 4 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆20Updated 3 years ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆46Updated 3 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆226Updated last year
- Curiosity-driven Exploration by Self-supervised Prediction☆145Updated 2 years ago
- Submission for MAVEN: Multi-Agent Variational Exploration☆59Updated 3 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆236Updated 2 years ago
- Learning to Incentivize Other Learning Agents☆35Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆160Updated 2 years ago