proroklab / rllib_differentiable_commsLinks
This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralized critics can be realized in RLLib. This example serves as a reference implementation and starting point for making RLLib more compatible with such architectures.
☆43Updated last year
Alternatives and similar repositories for rllib_differentiable_comms
Users that are interested in rllib_differentiable_comms are comparing it to the libraries listed below
Sorting:
- Partially Observable Process Gym☆198Updated 3 months ago
- Benchmarking RL generalization in an interpretable way.☆161Updated 3 months ago
- Datasets with baselines for Offline MARL.☆178Updated 3 weeks ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆43Updated this week
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆153Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆148Updated 2 years ago
- ☆237Updated 10 months ago
- A tool for aggregating and plotting MARL experiment data.☆77Updated 7 months ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆191Updated last year
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆122Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆188Updated 3 years ago
- ☆201Updated 2 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆211Updated last year
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆331Updated last year
- Representation Learning for RL☆126Updated 2 years ago
- The Starcraft Multi-Agent challenge lite☆40Updated last year
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆133Updated last year
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆103Updated last year
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆44Updated 5 months ago
- Conservative Q Learning on top of SAC☆132Updated 2 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆187Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆182Updated 6 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆131Updated 3 years ago
- ☆282Updated 3 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆48Updated last year
- PyTorch implementation of DreamerV2 model-based RL algorithm☆227Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆221Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆148Updated 2 years ago
- Deep Hierarchical Planning from Pixels☆107Updated 2 years ago