semitable / seacLinks
The official code base of Shared Experience Actor-Critic (NeurIPS2020)
☆25Updated last year
Alternatives and similar repositories for seac
Users that are interested in seac are comparing it to the libraries listed below
Sorting:
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆55Updated last year
- The Starcraft Multi-Agent challenge lite☆46Updated last year
- ☆30Updated 4 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆143Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆202Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆41Updated 5 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆67Updated 4 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Updated 2 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆341Updated last year
- Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment☆74Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆160Updated last year
- ☆49Updated 4 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Updated last year
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆46Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 3 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆87Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆204Updated last year
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Updated 3 years ago
- The official code base of Shared Experience Actor-Critic (NeurIPS2020)☆43Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Updated 2 years ago
- There will be updates later☆88Updated 6 years ago
- ☆100Updated 5 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆88Updated 2 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆112Updated 4 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆368Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆146Updated last year
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆41Updated 2 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆82Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆111Updated 3 years ago