alirezakazemipour / Discrete-SAC-PyTorchLinks
PyTorch implementation of discrete version of Soft Actor-Critic.
☆36Updated 3 years ago
Alternatives and similar repositories for Discrete-SAC-PyTorch
Users that are interested in Discrete-SAC-PyTorch are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆53Updated 3 years ago
- ☆40Updated 3 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆39Updated 2 years ago
- Code for Weighted QMIX☆138Updated 4 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆181Updated last year
- ☆101Updated 3 years ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆65Updated 6 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- This is the official implementation of Multi-Agent PPO.☆113Updated 2 years ago
- ☆89Updated 3 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆91Updated last year
- There will be updates later☆84Updated 6 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆105Updated 3 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Updated 2 years ago
- ☆48Updated 5 years ago
- pytorch实现的一些MARL算法☆67Updated 4 years ago
- Implementation of DyMA-CL, MARL algorithm☆27Updated 5 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆79Updated 3 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆23Updated 8 months ago
- ☆215Updated 2 years ago
- The code for maddpg using pytorch☆170Updated 4 years ago
- PyTorch implementation of Constrained Policy Optimization☆55Updated 3 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆219Updated 6 years ago
- PyTorch implementation of MATD3☆13Updated 5 years ago
- ☆76Updated 5 years ago
- ☆97Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆51Updated 5 months ago
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆48Updated 5 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆106Updated 4 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆59Updated 5 years ago