Stanford-ILIAD / PantheonRL
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.
☆134Updated last year
Alternatives and similar repositories for PantheonRL:
Users that are interested in PantheonRL are comparing it to the libraries listed below
- Lightweight multi-agent gridworld Gym environment☆201Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆169Updated 4 months ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆107Updated last year
- ☆211Updated 2 months ago
- ☆231Updated 11 months ago
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆194Updated 3 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆67Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆121Updated 5 months ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆160Updated last week
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆341Updated last year
- Gridworld for MARL experiments☆138Updated 4 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆118Updated 3 years ago
- Multi Task RL Baselines☆232Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆138Updated last year
- OpenAI Gym wrapper for the DeepMind Control Suite☆210Updated 8 months ago
- Benchmarking RL generalization in an interpretable way.☆138Updated 11 months ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆65Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆158Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆160Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆173Updated 2 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆154Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆124Updated last year
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆193Updated 2 years ago
- ☆243Updated 3 years ago
- Conservative Q Learning on top of SAC☆122Updated 2 years ago
- ☆338Updated 2 years ago
- Partially Observable Process Gym☆174Updated 6 months ago
- Code for the paper "Phasic Policy Gradient"☆259Updated last year
- ☆106Updated last year
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆118Updated last year