FitMachineLearning / PPO-Keras
Keras Implementation of PPO to solve OpenAI Gym Environments
☆16Updated 6 years ago
Alternatives and similar repositories for PPO-Keras:
Users that are interested in PPO-Keras are comparing it to the libraries listed below
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- A simple stochastic OpenAI environment for training RL agents☆88Updated last year
- Proximal Policy Optimization implementation with TensorFlow☆104Updated 6 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- Tensorflow implementation of Deep Deterministic Policy Gradients☆20Updated 7 years ago
- General purpose environment wrappers for openai gym☆24Updated 5 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 4 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆135Updated 6 years ago
- Proximal Policy Optimization(PPO) with Keras Implementation☆17Updated 4 years ago
- Evolving deep neural network agents using Genetic Algorithms☆66Updated 6 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆36Updated 6 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆102Updated 4 years ago
- Reinforcement Learning in Keras on VizDoom☆146Updated 7 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆59Updated 6 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Updated 5 years ago
- implement of prioritized experience replay☆158Updated 6 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated 2 years ago
- Deep Q Learning via Pytorch☆86Updated 7 years ago
- A reinforcement learning framework☆154Updated 6 years ago
- ☆73Updated 2 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆47Updated 4 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆175Updated 2 years ago
- ☆16Updated 4 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 7 years ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆39Updated 5 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆39Updated 3 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆133Updated 7 years ago
- ☆92Updated 4 years ago