ikostrikov / pytorch-trpoLinks

PyTorch implementation of Trust Region Policy Optimization

☆441

Alternatives and similar repositories for pytorch-trpo

Users that are interested in pytorch-trpo are comparing it to the libraries listed below

Sorting:

reinforcement-learning-kr / pg_travel
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
☆369Updated 6 years ago
katerakelly / oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
☆496Updated 2 years ago
haarnoja / softqlearning
Reinforcement Learning with Deep Energy-Based Policies
☆430Updated last year
ikostrikov / pytorch-ddpg-naf
Implementation of algorithms for continuous control (DDPG and NAF).
☆310Updated 4 years ago
Kaixhin / ACER
Actor-critic with experience replay
☆254Updated 2 years ago
WilsonWangTHU / mbbl
☆392Updated 6 years ago
justinjfu / inverse_rl
☆274Updated 7 years ago
jannerm / mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆504Updated 2 years ago
sfujim / BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆636Updated 4 years ago
jcwleo / random-network-distillation-pytorch
Random Network Distillation pytorch
☆251Updated 6 years ago
vy007vikas / PyTorch-ActorCriticRL
PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.
☆412Updated 4 years ago
dgriff777 / a3c_continuous
A continuous action space version of A3C LSTM in pytorch plus A3G design
☆258Updated 9 months ago
jachiam / cpo
Constrained Policy Optimization
☆322Updated 8 years ago
kchua / handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆451Updated 2 years ago
hungtuchen / pytorch-dqn
Deep Q-Learning Network in pytorch (not actively maintained)
☆403Updated 7 years ago
jeanharb / option_critic
Implementation of the Option-Critic Architecture on the Atari (ALE) environment
☆179Updated 7 years ago
rlcode / per
Prioritized Experience Replay (PER) implementation in PyTorch
☆345Updated 5 years ago
nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
☆106Updated 6 years ago
hoangminhle / hierarchical_IL_RL
Code for hierarchical imitation learning and reinforcement learning
☆294Updated 7 years ago
ghliu / pytorch-ddpg
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
☆616Updated 6 years ago
MishaLaskin / curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
☆592Updated 4 years ago
jonasrothfuss / ProMP
Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…
☆238Updated 2 years ago
go2sea / DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…
☆132Updated 7 years ago
pat-coady / trpo
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆360Updated 5 years ago
dgriff777 / rl_a3c_pytorch
A3C LSTM Atari with Pytorch plus A3G design
☆570Updated 2 years ago
yrlu / irl-imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
☆640Updated last year
chingyaoc / pytorch-REINFORCE
PyTorch Implementation of REINFORCE for both discrete & continuous control
☆266Updated 8 years ago
alexis-jacq / Pytorch-DPPO
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆183Updated 7 years ago
Kaixhin / PlaNet
Deep Planning Network: Control from pixels by latent planning with learned dynamics
☆371Updated 3 years ago
andrew-j-levy / Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
☆259Updated 5 years ago