Stanford-ILIAD / PantheonRL
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.
☆131Updated last year
Related projects ⓘ
Alternatives and complementary repositories for PantheonRL
- Level-based Foraging (LBF): A multi-agent environment for RL☆161Updated 2 months ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆114Updated 3 years ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆145Updated 2 weeks ago
- ☆201Updated this week
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆117Updated 3 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆117Updated last year
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆107Updated last year
- Lightweight multi-agent gridworld Gym environment☆198Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆331Updated last year
- Benchmarking RL generalization in an interpretable way.☆133Updated 9 months ago
- Code for MOPO: Model-based Offline Policy Optimization☆171Updated 2 years ago
- An API conversion tool for popular external reinforcement learning environments☆139Updated last month
- ☆218Updated 9 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆157Updated 2 years ago
- Partially Observable Process Gym☆167Updated 4 months ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆40Updated 2 months ago
- (NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation☆204Updated last year
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆307Updated 3 months ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆113Updated last year
- Implementation of Trajectory Transformer with attention caching and batched beam search☆107Updated last year
- Gridworld for MARL experiments☆137Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆153Updated 5 months ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated last year
- ☆332Updated 2 years ago
- ☆190Updated last year
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆149Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆177Updated 2 months ago
- This is a repository for Hidden-utility Self-Play.☆26Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆157Updated 2 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆209Updated last year