YangShengqi / cartpole_ppo_lstmLinks
☆13Updated 5 years ago
Alternatives and similar repositories for cartpole_ppo_lstm
Users that are interested in cartpole_ppo_lstm are comparing it to the libraries listed below
Sorting:
- Deep recurrent Q learning on CartPole-v1 environment☆93Updated last year
- Value-Decomposition Networks For Cooperative Multi-Agent Learning☆23Updated 4 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆116Updated 2 years ago
- ☆39Updated 3 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆192Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆89Updated 5 years ago
- pytorch实现的一些MARL算法☆68Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆53Updated 7 months ago
- PyTorch implementation of Foerster, Jakob N., et al. "Counterfactual multi-agent policy gradients."☆60Updated 5 years ago
- ☆217Updated 2 years ago
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆36Updated 4 years ago
- ☆102Updated 3 years ago
- ☆42Updated 6 years ago
- Project on multi agent reinforcement learning applied on patrolling agents☆40Updated 5 years ago
- Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery☆104Updated 3 years ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆49Updated 5 years ago
- ☆97Updated 4 years ago
- Multi-agent PPO with noise (97% win rates on Hard scenarios of SMAC)☆68Updated 2 years ago
- The implement of the policy gradient RL algorithm with pytorch☆39Updated 4 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆24Updated 7 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆39Updated 2 years ago
- MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT..…☆142Updated last year
- ☆76Updated 5 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆82Updated 2 years ago
- Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.☆71Updated 3 years ago
- qmix☆23Updated 5 years ago
- Code for Weighted QMIX☆140Updated 4 years ago
- The official code releasement of publications in MARL field of TJU RL lab.☆80Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 5 years ago
- Implementation of DyMA-CL, MARL algorithm☆28Updated 5 years ago