BY571 / SAC_discreteLinks
PyTorch implementation of the discrete Soft-Actor-Critic algorithm.
☆53Updated 3 years ago
Alternatives and similar repositories for SAC_discrete
Users that are interested in SAC_discrete are comparing it to the libraries listed below
Sorting:
- ☆101Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆180Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- ☆40Updated 3 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆72Updated 3 years ago
- This is the official implementation of Multi-Agent PPO.☆109Updated 2 years ago
- ☆211Updated 2 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆140Updated last year
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆65Updated 2 years ago
- pytorch实现的一些MARL算法☆67Updated 4 years ago
- Code for Weighted QMIX☆139Updated 4 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆79Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 5 years ago
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆160Updated last year
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆103Updated 3 years ago
- Implementation of DyMA-CL, MARL algorithm☆27Updated 5 years ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- Implementation for mSAC methods in PyTorch☆42Updated 3 years ago
- ☆85Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆36Updated 3 years ago
- ☆39Updated 2 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆87Updated 4 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆163Updated last year
- ☆96Updated 4 years ago
- Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation☆115Updated last week
- Implementation of PPO Lagrangian in PyTorch☆49Updated 2 years ago
- DSAC; Distributional Soft Actor-Critic☆129Updated 5 months ago
- Public implementation of "Multi-Agent Graph-Attention Communication and Teaming" from AAMAS'21☆85Updated last year
- ☆60Updated 4 years ago