coldsummerday / SD-SAC
Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)
☆19Updated 4 months ago
Alternatives and similar repositories for SD-SAC:
Users that are interested in SD-SAC are comparing it to the libraries listed below
- ☆40Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆50Updated 3 years ago
- ☆20Updated last year
- ☆38Updated 2 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆35Updated 5 months ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 4 years ago
- Implementation for mSAC methods in PyTorch☆41Updated 3 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- ☆96Updated 3 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 3 years ago
- Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Proble…☆47Updated 11 months ago
- Implementation of DyMA-CL, MARL algorithm☆26Updated 4 years ago
- There will be updates later☆84Updated 5 years ago
- Code for Weighted QMIX☆133Updated 4 years ago
- The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆52Updated last year
- pytorch实现的一些MARL算法☆66Updated 3 years ago
- ☆43Updated 4 years ago
- ☆92Updated 4 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆56Updated 2 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆40Updated 3 years ago
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆31Updated 4 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆60Updated last year
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆71Updated 2 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- This is the official implementation of Multi-Agent PPO.☆104Updated 2 years ago
- ☆28Updated 3 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago