Stanford-ILIAD / PantheonRL
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.
☆123Updated 10 months ago
Related projects: ⓘ
- Level-based Foraging (LBF): A multi-agent environment for RL☆152Updated this week
- Datasets with baselines for offline multi-agent reinforcement learning.☆125Updated this week
- Gridworld for MARL experiments☆137Updated 3 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆108Updated 2 years ago
- Lightweight multi-agent gridworld Gym environment☆193Updated 11 months ago
- ☆192Updated 7 months ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆107Updated last year
- ☆197Updated 7 months ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆161Updated last week
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆113Updated last month
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆150Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.☆128Updated 7 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆108Updated last year
- This is a repository for Hidden-utility Self-Play.☆26Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆325Updated last year
- Partially Observable Process Gym☆158Updated 2 months ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆104Updated last year
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆112Updated 2 years ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆101Updated last year
- OpenAI Gym wrapper for the DeepMind Control Suite☆200Updated 4 months ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆157Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆169Updated 2 years ago
- ☆328Updated last year
- 🚀 A fast safe reinforcement learning library in PyTorch☆150Updated last week
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆293Updated 3 weeks ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated 10 months ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆64Updated 9 months ago
- ☆187Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆146Updated last year
- A tool for aggregating and plotting MARL experiment data.☆57Updated 3 weeks ago