FitMachineLearning / PPO-Keras
Keras Implementation of PPO to solve OpenAI Gym Environments
☆16Updated 6 years ago
Alternatives and similar repositories for PPO-Keras:
Users that are interested in PPO-Keras are comparing it to the libraries listed below
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 5 years ago
- C51-DDQN in Keras☆125Updated 7 years ago
- ☆25Updated 5 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆175Updated 2 years ago
- Proximal Policy Optimization implementation with TensorFlow☆105Updated 6 years ago
- Implementation of Deep/Double Deep/Dueling Deep Q networks for playing Atari games using Keras and OpenAI gym☆40Updated 6 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆102Updated 4 years ago
- ☆16Updated 4 years ago
- TensorFlow implementation of asynchronous advantage actor-critic (A3C)☆39Updated 3 years ago
- This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.☆11Updated 4 years ago
- Proximal Policy Optimization(PPO) with Keras Implementation☆17Updated 4 years ago
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆103Updated 5 years ago
- Reinforcement Learning in Keras on VizDoom☆145Updated 7 years ago
- A reinforcement learning framework☆154Updated 6 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated 2 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Updated 7 years ago
- Tensorflow implementation of Deep Deterministic Policy Gradients☆19Updated 7 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆141Updated 3 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 5 years ago
- ☆73Updated 2 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆48Updated 4 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆104Updated 5 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Updated 6 years ago
- Hands-on Deep Reinforcement Learning, published by Packt☆69Updated 2 years ago
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- This is the code for "Actor Critic Algorithms" by Siraj Raval on Youtube☆75Updated 7 years ago
- A PyTorch implementation of Rainbow DQN agent☆168Updated 6 years ago
- implement of prioritized experience replay☆159Updated 6 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 4 years ago
- A simple stochastic OpenAI environment for training RL agents☆89Updated 2 years ago