FitMachineLearning / PPO-Keras
Keras Implementation of PPO to solve OpenAI Gym Environments
☆16Updated 7 years ago
Alternatives and similar repositories for PPO-Keras
Users that are interested in PPO-Keras are comparing it to the libraries listed below
Sorting:
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- C51-DDQN in Keras☆126Updated 7 years ago
- Proximal Policy Optimization implementation with TensorFlow☆107Updated 6 years ago
- Basic reinforcement learning implementation with tensorflow version 2.0☆52Updated 5 years ago
- Proximal Policy Optimization(PPO) with Keras Implementation☆17Updated 4 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Updated 4 years ago
- ☆25Updated 5 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆175Updated 2 years ago
- Tensorflow implementation of Deep Deterministic Policy Gradients☆19Updated 8 years ago
- A reinforcement learning framework☆155Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆140Updated 6 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- General purpose environment wrappers for openai gym☆25Updated 5 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- ☆72Updated 2 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆37Updated 2 years ago
- ☆15Updated 5 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆47Updated 4 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆132Updated 7 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆23Updated 11 months ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆40Updated 6 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Updated 8 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Updated 5 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 6 years ago