alirezakazemipour / Discrete-SAC-PyTorch
PyTorch implementation of discrete version of Soft Actor-Critic.
☆34Updated 3 years ago
Alternatives and similar repositories for Discrete-SAC-PyTorch:
Users that are interested in Discrete-SAC-PyTorch are comparing it to the libraries listed below
- ☆40Updated 3 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆50Updated 3 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆39Updated 6 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- There will be updates later☆84Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 4 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆56Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- ☆96Updated 3 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆69Updated 2 years ago
- Implementation of DyMA-CL, MARL algorithm☆26Updated 4 years ago
- ☆84Updated 3 years ago
- PyTorch implementation of Constrained Policy Optimization☆53Updated 3 years ago
- ☆92Updated 4 years ago
- Code for Weighted QMIX☆133Updated 4 years ago
- ☆38Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated 3 weeks ago
- The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.☆40Updated 2 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Distributional Soft Actor Critic☆52Updated 4 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆19Updated 4 months ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆83Updated last year
- Implementation for mSAC methods in PyTorch☆41Updated 3 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆60Updated last year
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆65Updated 3 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago
- ☆47Updated 4 years ago