nric / ProximalPolicyOptimizationKerasLinks

This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.

☆12

Alternatives and similar repositories for ProximalPolicyOptimizationKeras

Users that are interested in ProximalPolicyOptimizationKeras are comparing it to the libraries listed below

Sorting:

LuEE-C / PPO-Keras
My implementation of the Proximal Policy Optisation algorithm using Keras as a backend
☆88Updated 5 years ago
FitMachineLearning / PPO-Keras
Keras Implementation of PPO to solve OpenAI Gym Environments
☆16Updated 7 years ago
liziniu / RL-PPO-Keras
Proximal Policy Optimization(PPO) with Keras Implementation
☆17Updated 4 years ago
ChuaCheowHuan / reinforcement_learning
My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
☆37Updated 2 years ago
chagmgang / tf2.0_reinforcement_learning
Basic reinforcement learning implementation with tensorflow version 2.0
☆52Updated 5 years ago
shakti365 / soft-actor-critic
TF2 Implementation of the Soft Actor-Critic Algorithm
☆43Updated 2 years ago
marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆175Updated 2 years ago
CherryPieSexy / imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
☆146Updated 3 years ago
paraschopra / deepneuroevolution
Evolving deep neural network agents using Genetic Algorithms
☆68Updated 6 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆143Updated 6 years ago
takuseno / ppo
Proximal Policy Optimization implementation with TensorFlow
☆106Updated 6 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year
Stable-Baselines-Team / stable-baselines-tf2
[Experimental] TensorFlow 2 version of stable-baselines, temporary repository
☆45Updated 5 years ago
siekmanj / r2l
Recurrent continuous reinforcement learning algorithms implemented in Pytorch.
☆51Updated 4 years ago
liampetti / A3C-LSTM
A3C-LSTM algorithm tested on CartPole OpenAI Gym environment
☆48Updated 7 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆126Updated 5 years ago
Huixxi / TensorFlow2.0-for-Deep-Reinforcement-Learning
TensorFlow 2.0 for Deep Reinforcement Learning.
☆87Updated last year
sroj / neat-openai-gym
NEAT for Reinforcement Learning on the OpenAI Gym
☆27Updated 2 years ago
archsyscall / DistRL-TensorFlow2
🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2.
☆69Updated 4 years ago
PacktPublishing / Tensorflow-2-Reinforcement-Learning-Cookbook
Tensorflow 2 Reinforcement Learning Cookbook, published by Packt
☆195Updated 2 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
colinskow / move37
Coding Demos from the School of AI's Move37 Course
☆184Updated 6 years ago
cyoon1729 / RLcycle
A library for ready-made reinforcement learning agents and reusable components for neat prototyping
☆301Updated last year
udion / Transformer-RL
Experiments with transformer based RL algorithms
☆22Updated 5 years ago
karush17 / Hierarchical-Attention-Reinforcement-Learning
Hierarchical Attention in Reinforcement Learning for Stock Order Executions
☆30Updated 4 years ago
ngc92 / space-wrappers
General purpose environment wrappers for openai gym
☆26Updated 6 years ago
karush17 / Evolution-Strategies-PyTorch
Implementation of OpenAI's Evolution Strategies in PyTorch.
☆20Updated 5 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
phossen / gym-bubbleshooter
This repository contains the game bubble shooter as a gym environment. Based on: https://github.com/justinmeister/bubbleshooter
☆17Updated 5 years ago
orrivlin / Hindsight-Experience-Replay---Bit-Flipping
Simple bit flipping with sparse rewards using HER, similarly to the original paper
☆39Updated 6 years ago