liziniu / RL-PPO-Keras
Proximal Policy Optimization(PPO) with Keras Implementation
☆17Updated 4 years ago
Alternatives and similar repositories for RL-PPO-Keras:
Users that are interested in RL-PPO-Keras are comparing it to the libraries listed below
- Keras Implementation of PPO to solve OpenAI Gym Environments☆16Updated 6 years ago
- TensorFlow 2.0 for Deep Reinforcement Learning.☆85Updated last year
- This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.☆11Updated 4 years ago
- Proximal Policy Optimization implementation with TensorFlow☆105Updated 6 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆181Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆137Updated 6 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆259Updated 4 months ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆174Updated 2 years ago
- Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"☆207Updated 3 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆104Updated 5 years ago
- Actor-critic with experience replay☆252Updated 2 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆195Updated 2 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆304Updated 3 years ago
- PyTorch implementation of SAC-Discrete.☆298Updated 7 months ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…☆133Updated 7 years ago
- Random Network Distillation pytorch☆245Updated 6 years ago
- ☆303Updated last year
- An environment of the board game Go using OpenAI's Gym API☆174Updated 2 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆292Updated last year
- implement of prioritized experience replay☆160Updated 6 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆258Updated 4 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- ICML 2018 Self-Imitation Learning☆275Updated 4 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆372Updated 5 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆320Updated 5 years ago