dragen1860 / PPO-PytorchLinks

Pytorch Implementation of Proximal Policy Optimization Algorithm

☆20

Alternatives and similar repositories for PPO-Pytorch

Users that are interested in PPO-Pytorch are comparing it to the libraries listed below

Sorting:

TianhongDai / self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
☆66Updated 6 years ago
tpbarron / pytorch-ppo
Proximal Policy Optimization in PyTorch
☆39Updated 7 years ago
dai-dao / PPO-Pytorch
Implementation of PPO in Pytorch
☆41Updated 7 years ago
lnpalmer / PPO
PyTorch implementation of Proximal Policy Optimization
☆53Updated 7 years ago
jingweiz / pytorch-distributed
Ape-X DQN & DDPG with pytorch & tensorboard
☆102Updated 6 years ago
Alfo5123 / Robust-Multitask-RL
Machine Learning Course Project Skoltech 2018
☆108Updated 6 years ago
shagunsodhani / memory-augmented-self-play
PyTorch implementation of Memory Augmented Self-Play
☆52Updated 4 years ago
cxxgtxy / POP3D
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
☆44Updated 6 years ago
Breakend / OptionGAN
Code accompanying the OptionGAN paper.
☆44Updated 6 years ago
ShangtongZhang / DistributedES
Distributed implementation of popular evolutionary methods
☆64Updated 7 years ago
facebookresearch / M3RL
Mind-aware Multi-agent Management Reinforcement Learning
☆82Updated 6 years ago
kimhc6028 / pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
☆80Updated 6 years ago
facebookresearch / modeling_long_term_future
Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future
☆50Updated 6 years ago
onlytailei / A3C-PyTorch
PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch
☆114Updated 8 years ago
JunhongXu / ppo-pytorch
☆20Updated 7 years ago
itaicaspi / mgail
Model-Based Generative Adversarial Imitation Learning
☆89Updated 4 years ago
senya-ashukha / quantile-regression-dqn-pytorch
A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning
☆96Updated 4 years ago
andrewliao11 / pytorch-a3c-mujoco
Implement A3C for Mujoco gym envs
☆72Updated 7 years ago
Breakend / DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"
☆152Updated 7 years ago
facebookresearch / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Updated 5 years ago
vik-goel / MOREL
☆45Updated last year
camigord / DRL_papernotes
Notes and comments about Deep Reinforcement Learning papers
☆77Updated 7 years ago
veronicachelu / meta-learning
Meta Reinforcement Learning Experiments
☆34Updated 7 years ago
clvrai / FeatureControlHRL-Tensorflow
A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆32Updated 7 years ago
Kaixhin / NoisyNet-A3C
Noisy Networks for Exploration
☆186Updated 7 years ago
floodsung / meta-critic-networks
Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning
☆57Updated 7 years ago
activatedgeek / torchrl
Highly Modular and Scalable Reinforcement Learning
☆116Updated 5 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 7 years ago
junhyukoh / self-imitation-learning
ICML 2018 Self-Imitation Learning
☆278Updated 5 years ago
mbhenaff / EEN
EEN: Error Encoding Network
☆66Updated 7 years ago