divyahansg / RecurrentDPGLinks

CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)

☆11

Alternatives and similar repositories for RecurrentDPG

Users that are interested in RecurrentDPG are comparing it to the libraries listed below

Sorting:

eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
ewanlee / ICLR2019-RL-Papers
The Reinforcement-Learning-Related Papers of ICLR 2019
☆47Updated 6 years ago
fshamshirdar / pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
☆55Updated 2 years ago
louiskirsch / metagenrl
MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…
☆67Updated 5 years ago
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆26Updated 5 years ago
mengf1 / DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆66Updated 5 years ago
jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…
☆37Updated 6 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆95Updated 2 years ago
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆50Updated last week
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆125Updated 5 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
LihaoR / Entropy-Regularized-RL
soft q learning and soft actor critic
☆15Updated 6 years ago
xtma / dsac
Distributional Soft Actor Critic
☆55Updated 5 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆50Updated last month
go2sea / DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…
☆132Updated 7 years ago
kpaonaut / HAAR-A-Hierarchical-RL-Algorithm
Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
☆31Updated 2 years ago
orrivlin / MountainCar_DQN_RND
Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)
☆40Updated 6 years ago
tesslerc / GAC
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Updated 5 years ago
TonghanWang / DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
☆52Updated 2 years ago
adithya-subramanian / Multi_Agent_Soft_Actor_Critic
A Pytorch Implementation of Multi Agent Soft Actor Critic
☆40Updated 6 years ago
stevenpjg / RDPG
Recurrent Deterministic Policy Gradient actor-critic based Reinforcement Learning algorithm in Action
☆37Updated 4 months ago
navuboy / gail_gym
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
☆89Updated 6 years ago
llan-ml / tesp
Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"
☆34Updated 6 years ago
lweitkamp / feudalnets-pytorch
PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.
☆41Updated 5 years ago
Hwhitetooth / lirpg
☆61Updated 7 years ago
ciwang / policydistillation
Reproducing Policy Distillation (DeepMind paper ICLR 2016)
☆22Updated 5 years ago
hu-po / pySACQ
PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)
☆37Updated 4 years ago
SaminYeasar / Off_Policy_Adversarial_Inverse_Reinforcement_Learning
Implementation of Off Policy Adversarial Inverse Reinforcement Learning
☆23Updated 4 years ago
twni2016 / f-IRL
Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020
☆45Updated last year