YangShengqi / cartpole_ppo_lstm
☆13Updated 4 years ago
Alternatives and similar repositories for cartpole_ppo_lstm:
Users that are interested in cartpole_ppo_lstm are comparing it to the libraries listed below
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆40Updated 6 years ago
- Implementation of DyMA-CL, MARL algorithm☆26Updated 4 years ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆46Updated 5 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆49Updated last month
- Project on multi agent reinforcement learning applied on patrolling agents☆39Updated 5 years ago
- ☆38Updated 2 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆33Updated 5 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆87Updated last year
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆47Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆99Updated 2 years ago
- There will be updates later☆84Updated 5 years ago
- ☆84Updated 3 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆14Updated 4 years ago
- ☆96Updated 3 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆22Updated 6 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆40Updated 2 years ago
- Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning☆56Updated 2 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆40Updated 3 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆71Updated 2 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆63Updated 3 years ago
- This repository is the official implementation of Learning Multi-Agent Coordination for Enhancing Target Coverage in Directional Sensor N…☆44Updated 4 years ago
- Value-Decomposition Networks For Cooperative Multi-Agent Learning☆22Updated 3 years ago
- PyTorch implementation of discrete version of Soft Actor-Critic.☆34Updated 3 years ago
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Updated 2 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- ☆52Updated 6 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- MiniMax Multi-Agent Deep Deterministic Policy Gradient (M3DDPG) pytorch implementation☆10Updated 4 years ago
- ☆40Updated 3 years ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆58Updated 4 years ago