kimhc6028 / policy-gradient-importance-samplingLinks

Policy gradient reinforcement learning algorithm with importance sampling

☆32

Alternatives and similar repositories for policy-gradient-importance-sampling

Users that are interested in policy-gradient-importance-sampling are comparing it to the libraries listed below

Sorting:

go2sea / C51DQN
A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)
☆57Updated 7 years ago
floringogianu / categorical-dqn
A working implementation of the Categorical DQN (Distributional RL).
☆96Updated 7 years ago
Breakend / DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"
☆152Updated 7 years ago
Kaixhin / NoisyNet-A3C
Noisy Networks for Exploration
☆186Updated 7 years ago
eparisotto / ActorMimic
Train an RL agent to play multiple Atari games at once
☆69Updated 9 years ago
Breakend / OptionGAN
Code accompanying the OptionGAN paper.
☆44Updated 6 years ago
jingweiz / pytorch-distributed
Ape-X DQN & DDPG with pytorch & tensorboard
☆102Updated 6 years ago
Nat-D / FeatureControlHRL
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆80Updated 7 years ago
Kiwoo / distributional_perspective_on_RL
Implementation of A Distributional Perspective on Reinforcement Learning
☆35Updated 8 years ago
aravindr93 / robustRL
Robust policy search algorithms which train on model ensembles
☆30Updated 8 years ago
onlytailei / A3C-PyTorch
PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch
☆114Updated 8 years ago
gd-zhang / ACKTR
Actor Critic using Kronecker-Factored Trust Region
☆19Updated 7 years ago
Silvicek / distributional-dqn
Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…
☆132Updated 6 years ago
onlytailei / Value-Iteration-Networks-PyTorch
PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)
☆80Updated 8 years ago
dai-dao / PPO-Pytorch
Implementation of PPO in Pytorch
☆41Updated 7 years ago
nikonikolov / rltf
Reinforcement Learning implementations and research prototyping in TensorFlow
☆82Updated 6 years ago
kimhc6028 / pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
☆80Updated 6 years ago
TianhongDai / self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
☆66Updated 6 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 7 years ago
junhyukoh / value-prediction-network
NIPS 2017 Value Prediction Network
☆166Updated 7 years ago
pkumusic / E-DRL
Exploration Strategies for Deep Reinforcement Learning
☆39Updated 6 years ago
chingyaoc / pytorch-REINFORCE
PyTorch Implementation of REINFORCE for both discrete & continuous control
☆266Updated 8 years ago
alexis-jacq / Pytorch-DPPO
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆183Updated 7 years ago
andrewliao11 / pytorch-a3c-mujoco
Implement A3C for Mujoco gym envs
☆72Updated 7 years ago
ikostrikov / pytorch-rl
☆56Updated 6 years ago
florensacc / snn4hrl
Stochastic Neural Networks for Hierarchical Reinforcement Learning
☆96Updated 7 years ago
yrlu / reinforcement_learning
Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.
☆151Updated 2 years ago
rlbayes / rllabplusplus
☆159Updated 8 years ago
hiwonjoon / tf-a3c-gpu
Tensorflow implementation of A3C algorithm
☆46Updated 8 years ago
tanmayshankar / RCNN_MDP
Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.
☆69Updated 7 years ago