instadeepai / og-marl
Datasets with baselines for offline multi-agent reinforcement learning.
β167Updated this week
Alternatives and similar repositories for og-marl
Users that are interested in og-marl are comparing it to the libraries listed below
Sorting:
- π€ Elegant implementations of offline safe RL algorithms in PyTorchβ202Updated 8 months ago
- Level-based Foraging (LBF): A multi-agent environment for RLβ180Updated 8 months ago
- Author's PyTorch implementation of TD7 for online and offline RLβ143Updated last year
- A tool for aggregating and plotting MARL experiment data.β77Updated 3 months ago
- β245Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learningβ162Updated 10 months ago
- Multi-objective Gymnasium environments for reinforcement learningβ318Updated 2 months ago
- The Starcraft Multi-Agent challenge liteβ42Updated 8 months ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016β128Updated 9 months ago
- π A fast safe reinforcement learning library in PyTorchβ186Updated 7 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papersβ51Updated 2 years ago
- Prioritized Experience Replay implementation with proportional prioritizationβ77Updated last year
- β203Updated last year
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and coβ¦β135Updated last year
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environmentβ46Updated 8 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).β170Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPOβ167Updated 3 years ago
- Benchmarks for Multi-Objective Multi-Agent Decision Makingβ89Updated 2 months ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPβ¦β111Updated last year
- π₯ Datasets and env wrappers for offline safe reinforcement learningβ91Updated 8 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.β167Updated 6 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memoryβ178Updated 10 months ago
- β229Updated 5 months ago
- A collection of recent MARL papersβ91Updated 5 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RLβ359Updated 3 years ago
- Benchmarking RL generalization in an interpretable way.β156Updated 2 months ago
- β111Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTTβ142Updated last year
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.β85Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).β86Updated last year