kennethyu2017 / ddpg
reinforcement learning ddpg code. follow deepmind papers.
☆60Updated 6 years ago
Alternatives and similar repositories for ddpg:
Users that are interested in ddpg are comparing it to the libraries listed below
- Deep Recurrent Attention Reinforcement Learning in Atari☆83Updated 6 years ago
- Collection of Deep Reinforcement Learning algorithms☆124Updated 7 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆20Updated 3 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆49Updated 6 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- Reinforcement Learning in Python☆107Updated 5 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆100Updated 4 years ago
- scalable multi agents reinforcement learning☆54Updated 6 years ago
- some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…☆131Updated 2 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Updated 6 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Updated 7 years ago
- A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow☆16Updated 6 years ago
- Deep reinforcement learning agents implement by tensorflow https://ghli.org☆53Updated 5 years ago
- Implement Google Deep Minds DQN for multiple agents for a grid world environment where vehicles must pick up customers.☆27Updated 6 years ago
- Tensorflow + OpenAI Gym implementation of Deep Q-Network (DQN), Double DQN (DDQN), Dueling Network and Deep Deterministic Policy Gradient…☆75Updated 7 years ago
- Repository for codes of 'Deep Reinforcement Learning'☆215Updated 5 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients