jiajunhua / MorvanZhou-Reinforcement-learning-with-tensorflow
☆14Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for MorvanZhou-Reinforcement-learning-with-tensorflow
- Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]☆300Updated 2 years ago
- Deep recurrent Q learning on CartPole-v1 environment☆75Updated 10 months ago
- The implementation of LSTM-TD3.☆64Updated last year
- 强化学习面试(未完待续)☆32Updated 4 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆92Updated 2 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆27Updated 5 years ago
- ☆71Updated 5 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆45Updated 4 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆19Updated 4 years ago
- Tutorial for Reinforcement Learning☆172Updated 2 years ago
- 强化学习中纳什Qlearning 实现矩阵博弈☆28Updated 5 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆449Updated 3 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆47Updated 6 years ago
- Transplant a implementation of MADDPG to the environment provided by openAI (multiagent-particle-envs).☆19Updated 3 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆82Updated last year
- Practice of Deep Reinforcement Learning with Keras and gym.☆157Updated 5 years ago
- Reinforcement Learning Algorithms Based on PyTorch☆17Updated 2 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆39Updated 4 years ago
- My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0☆28Updated 2 years ago
- 一些利用pytorch编程实现的强化学习例子☆35Updated 5 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆60Updated 3 years ago
- PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....☆52Updated 4 years ago
- Intelligent control algorithm and simulation environment.☆17Updated 4 years ago
- Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty☆56Updated 5 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆82Updated 6 years ago
- simple code to reinforcement learning☆19Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆122Updated 5 months ago
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆192Updated 5 years ago
- DSAC; Distributional Soft Actor-Critic☆113Updated 9 months ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆10Updated 3 years ago