Felhof / DiscreteSAC
☆40Updated 3 years ago
Alternatives and similar repositories for DiscreteSAC:
Users that are interested in DiscreteSAC are comparing it to the libraries listed below
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆51Updated 3 years ago
- A clean and robust Pytorch implementation of SAC on discrete action space☆35Updated 6 months ago
- Code for Weighted QMIX☆136Updated 4 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆102Updated 2 years ago
- There will be updates later☆84Updated 5 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆33Updated 3 years ago
- I2Q: A Fully Decentralized Q-Learning Algorithm☆18Updated 2 years ago
- ☆93Updated 4 years ago
- ☆96Updated 3 years ago
- Single-file pytorch implementation of hybrid-SAC☆58Updated 3 years ago
- ☆39Updated 2 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆21Updated 5 months ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 2 months ago
- This is the official implementation of Multi-Agent PPO.☆105Updated 2 years ago
- ☆38Updated 3 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆46Updated 7 months ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆107Updated 3 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆75Updated 2 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 3 years ago
- PyTorch implementation of Constrained Policy Optimization☆54Updated 3 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- ☆49Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆88Updated last year
- ☆44Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆169Updated last year
- ☆75Updated 5 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 4 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆59Updated 4 years ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆40Updated 2 years ago