Stanford-ILIAD / PantheonRLLinks
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.
☆154Updated last year
Alternatives and similar repositories for PantheonRL
Users that are interested in PantheonRL are comparing it to the libraries listed below
Sorting:
- Lightweight multi-agent gridworld Gym environment☆210Updated 2 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆196Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆135Updated last year
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆210Updated 4 years ago
- Datasets with baselines for Offline MARL.☆181Updated 2 months ago
- Partially Observable Process Gym☆202Updated 4 months ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆324Updated 2 years ago
- ☆239Updated 11 months ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆360Updated 2 years ago
- ☆268Updated last year
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆335Updated last year
- Benchmarking RL generalization in an interpretable way.☆166Updated this week
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆109Updated 2 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆163Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆179Updated 3 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆325Updated 3 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆131Updated 3 years ago
- ☆354Updated 3 years ago
- A tool for aggregating and plotting MARL experiment data.☆78Updated 9 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆172Updated 11 months ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆222Updated last year
- PyTorch implementation of DreamerV2 model-based RL algorithm☆229Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆150Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆266Updated 2 years ago
- ☆201Updated 2 years ago
- This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆43Updated 2 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆192Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆77Updated 2 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆142Updated 2 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 3 years ago