msinto93 / D4PGLinks

Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym environments.

☆126

Alternatives and similar repositories for D4PG

Users that are interested in D4PG are comparing it to the libraries listed below

Sorting:

nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
☆106Updated 6 years ago
takuseno / ppo
Proximal Policy Optimization implementation with TensorFlow
☆106Updated 6 years ago
andrew-j-levy / Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
☆259Updated 5 years ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆318Updated 3 years ago
uidilr / gail_ppo_tf
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
☆115Updated 6 years ago
go2sea / DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…
☆132Updated 7 years ago
neka-nat / distributed_rl
Pytorch implementation of distributed deep reinforcement learning
☆76Updated 3 years ago
createamind / DRL
☆92Updated 4 years ago
wwxFromTju / deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…
☆127Updated 2 years ago
yilunc2020 / Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
☆83Updated 7 years ago
marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆175Updated 2 years ago
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆316Updated 2 years ago
hoangminhle / hierarchical_IL_RL
Code for hierarchical imitation learning and reinforcement learning
☆294Updated 7 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆143Updated 6 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated 2 weeks ago
iclavera / learning_to_adapt
Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning
☆214Updated 2 years ago
navuboy / gail_gym
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
☆89Updated 6 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆96Updated 5 years ago
vitchyr / multiworld
Multitask Environments for RL
☆278Updated 3 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
jachiam / cpo
Constrained Policy Optimization
☆322Updated 8 years ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆289Updated 4 years ago
jannerm / mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆504Updated 2 years ago
jangirrishabh / Overcoming-exploration-from-demos
Implementation of the paper "Overcoming Exploration in Reinforcement Learning with Demonstrations" Nair et al. over the HER baselines fro…
☆154Updated 3 years ago
uber-research / ape-x
This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"
☆190Updated 6 years ago
mengf1 / DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆66Updated 5 years ago
jcwleo / random-network-distillation-pytorch
Random Network Distillation pytorch
☆251Updated 6 years ago
YuhangSong / Arena-BuildingToolkit
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆84Updated 4 years ago
dgriff777 / a3c_continuous
A continuous action space version of A3C LSTM in pytorch plus A3G design
☆258Updated 9 months ago