longtermrisk / marltoolboxLinks
A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).
☆32Updated 2 years ago
Alternatives and similar repositories for marltoolbox
Users that are interested in marltoolbox are comparing it to the libraries listed below
Sorting:
- Gridworld for MARL experiments☆141Updated 4 years ago
- ☆311Updated 9 months ago
- A gym interface for AI safety gridworlds created in pycolab.☆18Updated 3 years ago
- PAIRED in PyTorch 🔥☆63Updated 2 years ago
- A tool for aggregating and plotting MARL experiment data.☆77Updated 8 months ago
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆209Updated 4 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆148Updated 2 years ago
- ☆237Updated 10 months ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆109Updated 2 years ago
- ☆49Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆19Updated 2 years ago
- ☆201Updated 2 years ago
- Partially Observable Process Gym☆199Updated 3 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 2 years ago
- Learning to Incentivize Other Learning Agents☆34Updated 3 years ago
- Benchmarking RL generalization in an interpretable way.☆162Updated 3 months ago
- Gridworld domains in the gym interface☆29Updated 11 months ago
- ☆351Updated 2 years ago
- Nethack Learning Environment Wrapper for Language Interface☆38Updated 2 years ago
- Simple gym environments for safety in Reinforcement Learning Research☆18Updated last year
- Object Centric Atari games☆89Updated 2 weeks ago
- Repo for reproduction of sequential social dilemmas☆407Updated 6 months ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆227Updated 2 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆194Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆89Updated 4 years ago
- The Implementation of "Machine Theory of Mind", ICML 2018☆24Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆188Updated 3 years ago
- impact-driven-exploration☆132Updated last year
- The Starcraft Multi-Agent challenge lite☆40Updated last year
- SocialJax: sequential social dilemma environments☆45Updated 2 weeks ago