coldsummerday / SD-SAC
Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)
☆18Updated this week
Related projects ⓘ
Alternatives and complementary repositories for SD-SAC
- ☆17Updated 9 months ago
- Bayesian Soft Actor Critic☆15Updated last year
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆44Updated 3 years ago
- ☆39Updated 3 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated last year
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆52Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆96Updated 2 years ago
- ☆36Updated 2 years ago
- ☆28Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated last year
- Implementation for mSAC methods in PyTorch☆37Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆29Updated 3 years ago
- Code for ICML2023 accepted paper: Complementary Attention for Multi-Agent Reinforcement Learning.☆16Updated last year
- ☆90Updated 3 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆62Updated 3 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated last year
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated last year
- There will be updates later☆82Updated 5 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆54Updated last year
- ☆34Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆84Updated last year
- Implementation of DyMA-CL, MARL algorithm☆26Updated 4 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆13Updated 3 years ago
- ☆44Updated 3 years ago
- Code for "ALMA: Hierarchical Learning for Composite Multi-Agent Tasks" NeurIPS 2022☆25Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆57Updated 2 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆55Updated 4 years ago
- ☆42Updated 3 years ago
- The official code base of Shared Experience Actor-Critic (NeurIPS2020)☆33Updated 9 months ago