proroklab / rllib_differentiable_commsLinks

This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralized critics can be realized in RLLib. This example serves as a reference implementation and starting point for making RLLib more compatible with such architectures.

☆43

Alternatives and similar repositories for rllib_differentiable_comms

Users that are interested in rllib_differentiable_comms are comparing it to the libraries listed below

Sorting:

proroklab / popgym
Partially Observable Process Gym
☆196Updated last month
instadeepai / og-marl
Datasets with baselines for Offline MARL.
☆176Updated 2 weeks ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆130Updated last year
automl / CARL
Benchmarking RL generalization in an interpretable way.
☆159Updated last month
twni2016 / pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆329Updated 11 months ago
instadeepai / marl-eval
A tool for aggregating and plotting MARL experiment data.
☆77Updated 6 months ago
moratodpg / imp_marl
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL
☆42Updated 10 months ago
ffelten / CrazyRL
JAX and PZ RL envs + algorithms for swarms of CrazyFlies
☆79Updated 11 months ago
semitable / lb-foraging
Level-based Foraging (LBF): A multi-agent environment for RL
☆188Updated 10 months ago
uoe-agents / lb-foraging
Level-Based Foraging (LBF): A multi-agent reinforcement learning environment
☆48Updated 10 months ago
ArnaudFickinger / gym-multigrid
Lightweight multi-agent gridworld Gym environment
☆209Updated last year
Farama-Foundation / D4RL-Evaluations
☆199Updated 2 years ago
kandouss / marlgrid
Gridworld for MARL experiments
☆141Updated 4 years ago
Stanford-ILIAD / PantheonRL
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…
☆152Updated last year
openrlbenchmark / openrlbenchmark
☆235Updated 8 months ago
jbr-ai-labs / mamba
This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".
☆58Updated 4 months ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆182Updated 3 years ago
uoe-agents / smaclite
The Starcraft Multi-Agent challenge lite
☆41Updated 10 months ago
ini / multigrid
Fast and flexible multi-agent gridworld reinforcement learning environments.
☆43Updated 4 months ago
liuzuxin / OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
☆207Updated 10 months ago
instadeepai / awesome-marl
A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers
☆53Updated 2 years ago
ruizhaogit / maximum_entropy_population_based_training
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination
☆28Updated 2 years ago
uoe-agents / pressureplate
Repo for the multi-agent PressurePlate environment
☆16Updated 3 years ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆102Updated 9 months ago
lucaslingle / pytorch_rl2
Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'
☆65Updated 3 years ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆175Updated 3 years ago
ikostrikov / implicit_q_learning
☆279Updated 3 years ago
yardenas / la-mbda
LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization
☆35Updated 2 years ago
luchris429 / model-free-opponent-shaping
Code for Model-Free Opponent Shaping (ICML 2022)
☆19Updated 2 years ago
MarcoMeter / episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆183Updated last year