stevenpjg / RDPGLinks

Recurrent Deterministic Policy Gradient actor-critic based Reinforcement Learning algorithm in Action

☆37

Alternatives and similar repositories for RDPG

Users that are interested in RDPG are comparing it to the libraries listed below

Sorting:

fshamshirdar / pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
☆55Updated 2 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆126Updated 5 years ago
yilunc2020 / Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
☆84Updated 6 years ago
jachiam / cpo
Constrained Policy Optimization
☆322Updated 8 years ago
go2sea / DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…
☆132Updated 7 years ago
uidilr / gail_ppo_tf
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
☆115Updated 6 years ago
liampetti / DDPG
Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…
☆64Updated 8 years ago
divyahansg / RecurrentDPG
CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)
☆11Updated 8 years ago
navuboy / gail_gym
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
☆89Updated 6 years ago
hoangminhle / hierarchical_IL_RL
Code for hierarchical imitation learning and reinforcement learning
☆293Updated 7 years ago
nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
☆106Updated 6 years ago
wwxFromTju / deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…
☆127Updated 2 years ago
theophilegervet / options-hierarchical-rl
☆26Updated 7 years ago
LihaoR / Entropy-Regularized-RL
soft q learning and soft actor critic
☆15Updated 6 years ago
jangirrishabh / toyCarIRL
Implementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinf…
☆176Updated 3 years ago
takuseno / ppo
Proximal Policy Optimization implementation with TensorFlow
☆107Updated 6 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆142Updated 6 years ago
rohan-sawhney / multi-agent-rl
☆77Updated 7 years ago
mynkpl1998 / Recurrent-Deep-Q-Learning
Solving POMDP using Recurrent networks
☆87Updated 5 years ago
anagabandi / nn_dynamics
☆344Updated 7 years ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆317Updated 3 years ago
jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…
☆37Updated 6 years ago
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆316Updated 2 years ago
iclavera / learning_to_adapt
Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning
☆214Updated 2 years ago
vaishak2future / sac
Implementation of Soft Actor Critic
☆37Updated 3 years ago
gopala-kr / DRL-Agents
research and implementations of Deep RL agents and their applications
☆54Updated 2 weeks ago
marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆175Updated 2 years ago
atavakol / action-branching-agents
(AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning
☆117Updated 2 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
aijunbai / taxi
Hierarchical Online Planning and Reinforcement Learning on Taxi
☆30Updated 7 years ago