LuEE-C / PPO-KerasLinks

My implementation of the Proximal Policy Optisation algorithm using Keras as a backend

☆88

Alternatives and similar repositories for PPO-Keras

Users that are interested in PPO-Keras are comparing it to the libraries listed below

Sorting:

cyoon1729 / RLcycle
A library for ready-made reinforcement learning agents and reusable components for neat prototyping
☆301Updated last year
flyyufelix / C51-DDQN-Keras
C51-DDQN in Keras
☆126Updated 7 years ago
takuseno / ppo
Proximal Policy Optimization implementation with TensorFlow
☆106Updated 6 years ago
MG2033 / A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆182Updated 6 years ago
greydanus / baby-a3c
A high-performance Atari A3C agent in 180 lines of PyTorch
☆171Updated 4 years ago
flyyufelix / VizDoom-Keras-RL
Reinforcement Learning in Keras on VizDoom
☆143Updated 7 years ago
pat-coady / trpo
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆360Updated 5 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆126Updated 5 years ago
uber-research / ape-x
This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"
☆190Updated 6 years ago
Nasdin / ReinforcementLearning-AtariGame
Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…
☆187Updated 10 months ago
marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆175Updated 2 years ago
Anjum48 / rl-examples
Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow
☆103Updated 5 years ago
openai / gym-soccer
☆304Updated 2 years ago
Damcy / prioritized-experience-replay
implement of prioritized experience replay
☆159Updated 6 years ago
rgilman33 / simple-A2C-PPO
Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.
☆102Updated 5 years ago
go2sea / DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…
☆132Updated 7 years ago
hengyuan-hu / rainbow
A PyTorch implementation of Rainbow DQN agent
☆169Updated 7 years ago
Kaixhin / ACER
Actor-critic with experience replay
☆254Updated 2 years ago
liampetti / DDPG
Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…
☆64Updated 8 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
alexis-jacq / Pytorch-DPPO
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆183Updated 7 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
google-research / episodic-curiosity
Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability
☆204Updated 4 years ago
mehdiboubnan / Deep-Reinforcement-Learning-applied-to-DOOM
DQN, DDDQN, A3C, PPO, Curiosity applied to the game DOOM
☆87Updated 4 years ago
ray-project / rl-experiments
Keeping track of RL experiments
☆162Updated 2 years ago
dgriff777 / a3c_continuous
A continuous action space version of A3C LSTM in pytorch plus A3G design
☆258Updated 9 months ago
Silvicek / distributional-dqn
Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…
☆132Updated 6 years ago
uidilr / gail_ppo_tf
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
☆115Updated 6 years ago
mrahtz / learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆325Updated 3 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆143Updated 6 years ago