wangyy161 / DDPG_CNN_Pendulum_practice
practice
☆9Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for DDPG_CNN_Pendulum_practice
- My DRL library with tensorflow1.14 based on openai spinning-up☆58Updated 3 years ago
- Exploring the performance of Prioritized Experience Replay (PER) with the DDPG+HER scheme on the Fetch Robotics Environemnt☆16Updated 3 years ago
- ReinforcementLearning Learn Play Atari Using DDPG and LSTM.☆20Updated 7 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆19Updated 3 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆46Updated 5 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆61Updated 6 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆82Updated last year
- ☆16Updated 2 years ago
- Implement many Sparse Reward algorithms in Gym Fetch environment☆81Updated 4 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆45Updated 4 years ago
- RL projects including implementation of DQN/DDPG/MADDPG/BicNet on StarCraft II multi-agent learning environment SMAC☆42Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆93Updated 3 years ago
- Hello😜☆30Updated 4 years ago
- ☆96Updated 4 months ago
- The implement of the policy gradient RL algorithm with pytorch☆37Updated 3 years ago
- Implementation of DDPG+HER on gym robotics environment FetchReach-v1☆32Updated 5 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 5 years ago
- ☆41Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆121Updated 5 months ago
- ☆47Updated 4 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆99Updated 3 years ago
- ☆32Updated last year
- ICML 2019 RL for Real Life Workshop: Recurrent MADDPG for Partially Observable and Limited Communication Settings☆40Updated 4 years ago
- ☆21Updated 6 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆145Updated 6 months ago
- ☆56Updated 4 years ago
- ☆71Updated 5 years ago
- ☆47Updated 5 years ago
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆27Updated last year