jiajunhua / MorvanZhou-Reinforcement-learning-with-tensorflow
☆14Updated 7 years ago
Alternatives and similar repositories for MorvanZhou-Reinforcement-learning-with-tensorflow:
Users that are interested in MorvanZhou-Reinforcement-learning-with-tensorflow are comparing it to the libraries listed below
- Practice of Deep Reinforcement Learning with Keras and gym.☆158Updated 5 years ago
- Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]☆308Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆50Updated 2 months ago
- The implementation of LSTM-TD3.☆79Updated 2 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆99Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆89Updated last year
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 6 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆42Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆143Updated 10 months ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- Proximal Policy Optimization with Tensorflow 2.0☆31Updated 5 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆88Updated last year
- 一些利用pytorch编程实现的强化学习例子☆36Updated 6 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆10Updated 3 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆92Updated 4 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 5 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆19Updated 2 years ago
- My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0☆29Updated 3 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆50Updated 6 years ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆207Updated 5 years ago
- Tutorial for Reinforcement Learning☆185Updated 3 years ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- TensorFlow2 Reinforcement Learning☆475Updated 3 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago
- Stable Baselines官方文档中文版☆97Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆106Updated 3 years ago
- TD3 in Pytorch☆31Updated 3 years ago
- Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]☆72Updated 5 years ago
- ☆185Updated 3 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆105Updated 5 years ago