semitable / seacLinks
The official code base of Shared Experience Actor-Critic (NeurIPS2020)
☆23Updated last year
Alternatives and similar repositories for seac
Users that are interested in seac are comparing it to the libraries listed below
Sorting:
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆48Updated 10 months ago
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆70Updated 10 months ago
- The Starcraft Multi-Agent challenge lite☆41Updated 10 months ago
- The official code base of Shared Experience Actor-Critic (NeurIPS2020)☆39Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆165Updated last year
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 4 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆329Updated 11 months ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆188Updated 10 months ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- ☆28Updated 3 years ago
- A collection of recent MARL papers☆94Updated 8 months ago
- Gridworld for MARL experiments☆141Updated 4 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆81Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆171Updated 8 months ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆130Updated last year
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Updated 2 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆40Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆151Updated last year
- Lightweight multi-agent gridworld Gym environment☆209Updated last year
- ☆49Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆181Updated last year
- Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)☆115Updated 2 years ago
- There will be updates later☆84Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 2 years ago
- Deep Implicit Coordination Graphs☆41Updated last year
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆77Updated 7 months ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆358Updated 2 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆180Updated 2 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆105Updated 3 years ago