Stanford-ILIAD / PantheonRLLinks
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.
☆157Updated 2 years ago
Alternatives and similar repositories for PantheonRL
Users that are interested in PantheonRL are comparing it to the libraries listed below
Sorting:
- Lightweight multi-agent gridworld Gym environment☆212Updated 2 years ago
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆216Updated 4 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Updated 2 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆139Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆199Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆363Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.☆173Updated last month
- ☆246Updated last year
- Datasets with baselines for Offline MARL.☆193Updated last month
- OpenAI Gym wrapper for the DeepMind Control Suite☆226Updated last year
- PyTorch implementation of DreamerV2 model-based RL algorithm☆236Updated 2 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆167Updated 3 years ago
- ☆359Updated 3 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆330Updated 2 years ago
- Gridworld for MARL experiments☆144Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆182Updated 3 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆340Updated last year
- ☆281Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆132Updated 4 years ago
- Partially Observable Process Gym☆209Updated 6 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆157Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆321Updated last year
- ☆201Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆190Updated 3 years ago
- Multi Task RL Baselines☆258Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆145Updated 2 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆268Updated 5 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆200Updated last year
- The Starcraft Multi-Agent challenge lite☆43Updated last year