NadeemWard / pytorch_simple_policy_gradientsLinks

Reimplementation of simple policy gradient algorithms such as REINFORCE and Actor-Critic methods.

☆13

Alternatives and similar repositories for pytorch_simple_policy_gradients

Users that are interested in pytorch_simple_policy_gradients are comparing it to the libraries listed below

Sorting:

johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆55Updated 3 months ago
Pervasive-AI-Lab / crlmaze
Continual Reinforcement Learning in 3D Non-stationary Environments
☆38Updated 6 years ago
uncharted-technologies / risk-and-uncertainty
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆30Updated 2 years ago
kazizzad / BDQN-MxNet-Gluon
Efficient Exploration through Bayesian Deep Q-Networks
☆37Updated 7 years ago
Adaptive-RL / AdaRL-code
Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…
☆38Updated last year
tessavdheiden / social_empowerment
☆17Updated 11 months ago
kaixin96 / mixreg
Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization
☆33Updated 4 years ago
amazon-science / meta-q-learning
Code for the paper "Meta-Q-Learning"( ICLR 2020)
☆103Updated 3 years ago
Miffyli / rl-action-space-shaping
Experiment code for testing effect of various action space transformations in reinforcement learning
☆30Updated 5 years ago
robintyh1 / onpolicybaselines
on-policy optimization baselines for deep reinforcement learning
☆30Updated 5 years ago
llan-ml / tesp
Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"
☆34Updated 6 years ago
bonniesjli / DQN_SR
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Updated 6 years ago
google-deepmind / active_ops
☆32Updated 10 months ago
dannysdeng / dqn-pytorch
PyTorch - Implicit Quantile Networks - Quantile Regression - C51
☆22Updated 5 years ago
rlai-lab / Regularized-GradientTD
Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.
☆37Updated 4 years ago
BY571 / Munchausen-RL
PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN
☆45Updated 4 years ago
LinZichuan / AdMRL
Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)
☆35Updated 4 years ago
YYCAAA / V-MPO_Lunarlander
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆47Updated 4 years ago
geyang / e-maml
E-MAML, and RL-MAML baseline implemented in Tensorflow v1
☆16Updated 5 years ago
amazon-science / replay-based-recurrent-rl
Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"
☆34Updated 2 years ago
pokaxpoka / netrand
Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020
☆54Updated 5 years ago
Steven-Ho / VALOR
Implementation of VALOR (Variational Option Discovery Algorithms)
☆10Updated 6 years ago
Hwhitetooth / lirpg
☆61Updated 7 years ago
oist-cnru / Variational-Recurrent-Models
Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…
☆55Updated 4 years ago
google-research / deep_ope
☆86Updated 10 months ago
mklissa / phi_gcn
Reward Propagation using Graph Convolutional Networks
☆13Updated 4 years ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆92Updated last week
ElisevanderPol / symmetrizer
☆31Updated 4 years ago
guptav96 / BDQN-PyTorch
Efficient Exploration through Bayesian Deep-Q Networks.
☆17Updated 3 years ago
google-research / dice_rl
☆104Updated 10 months ago