liziniu / RL-PPO-Keras
Proximal Policy Optimization(PPO) with Keras Implementation
☆17Updated 4 years ago
Alternatives and similar repositories for RL-PPO-Keras:
Users that are interested in RL-PPO-Keras are comparing it to the libraries listed below
- Keras Implementation of PPO to solve OpenAI Gym Environments☆16Updated 6 years ago
- This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.☆11Updated 4 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆177Updated 4 months ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- Proximal Policy Optimization implementation with TensorFlow☆105Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆135Updated 6 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆175Updated 2 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆312Updated 4 years ago
- Tensorflow implementation of a Deep Deterministic Policy Gradient (DDPG) network, trained on OpenAI Gym environments.☆22Updated 6 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆304Updated 3 years ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆39Updated 5 years ago
- implement of prioritized experience replay☆158Updated 6 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆258Updated 3 months ago
- ☆92Updated 4 years ago
- A pytorch tutorial for DRL(Deep Reinforcement Learning)☆209Updated last year
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆300Updated 11 months ago
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆103Updated 4 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Updated 7 years ago
- Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"☆207Updated 3 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆132Updated 7 years ago
- Actor-critic with experience replay☆251Updated 2 years ago
- OpenAI gym-based algorithm for the grid world problem☆28Updated 4 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆285Updated 3 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆40Updated 6 years ago
- Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.☆150Updated last year
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆181Updated 5 years ago
- Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action☆113Updated 6 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- PyTorch implementation of QR-DQN: Distributional Reinforcement Learning with Quantile Regression☆26Updated 4 years ago