shareeff / PPOLinks

Tensorflow implementation of proximal policy optimization (PPO) algorithm

☆13

Alternatives and similar repositories for PPO

Users that are interested in PPO are comparing it to the libraries listed below

Sorting:

liampetti / DDPG
Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…
☆64Updated 8 years ago
alexis-jacq / Pytorch-DPPO
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆183Updated 7 years ago
takuseno / ppo
Proximal Policy Optimization implementation with TensorFlow
☆107Updated 6 years ago
sjchoi86 / irl_rocks
Cool Inverse Reinforcement Learning Papers
☆124Updated 8 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
Alfo5123 / Robust-Multitask-RL
Machine Learning Course Project Skoltech 2018
☆108Updated 6 years ago
go2sea / DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…
☆132Updated 7 years ago
tesslerc / H-DRLN
Hierarchical Deep RL Network
☆31Updated 8 years ago
vkurenkov / hierarchical-skill-acquisition
Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…
☆11Updated 7 years ago
mrkulk / hierarchical-deep-RL
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation
☆87Updated 7 years ago
xinleipan / gym-gridworld
Simple grid-world environment compatible with OpenAI-gym
☆50Updated 5 years ago
ermongroup / multiagent-gail
☆83Updated 6 years ago
ahq1993 / inverse_rl
Adversarial Imitation Via Variational Inverse Reinforcement Learning
☆95Updated 5 years ago
davidhershey / feudal_networks
An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161
☆183Updated 7 years ago
RLOpensource / Relational_Deep_Reinforcement_Learning
☆49Updated 6 years ago
lnpalmer / PPO
PyTorch implementation of Proximal Policy Optimization
☆52Updated 7 years ago
jcwleo / mario_rl
☆69Updated 6 years ago
junhyukoh / self-imitation-learning
ICML 2018 Self-Imitation Learning
☆278Updated 5 years ago
younggyoseo / pytorch-acer
PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)
☆17Updated 4 years ago
takoika / PrioritizedExperienceReplay
Yet another prioritized experience replay buffer implementation.
☆48Updated 2 years ago
reinforcement-learning-kr / rl-montezuma
The state-of-art deep rl algorithms for Montezuma's revenge
☆27Updated 6 years ago
yilunc2020 / Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
☆85Updated 6 years ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 6 years ago
Nat-D / FeatureControlHRL
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆80Updated 7 years ago
tegg89 / magnet
MAGNet: Multi-agents control using Graph Neural Networks
☆132Updated 6 years ago
liampetti / A3C-LSTM
A3C-LSTM algorithm tested on CartPole OpenAI Gym environment
☆48Updated 6 years ago
mengf1 / DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆66Updated 5 years ago
jeanharb / option_critic
Implementation of the Option-Critic Architecture on the Atari (ALE) environment
☆177Updated 7 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆125Updated 5 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆95Updated 2 years ago