spring01 / drlboxLinks

Interfacing RL agents with user-definable neural networks and OpenAI-gym environments.

☆12

Alternatives and similar repositories for drlbox

Users that are interested in drlbox are comparing it to the libraries listed below

Sorting:

marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆175Updated 2 years ago
cmusjtuliuyuan / RainBow
RainBow, Tensorflow
☆49Updated 7 years ago
xlnwel / model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
☆62Updated 4 years ago
Damcy / prioritized-experience-replay
implement of prioritized experience replay
☆159Updated 6 years ago
MG2033 / A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆181Updated 6 years ago
liampetti / DDPG
Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…
☆64Updated 8 years ago
LuEE-C / PPO-Keras
My implementation of the Proximal Policy Optisation algorithm using Keras as a backend
☆88Updated 5 years ago
jcwleo / mario_rl
☆69Updated 6 years ago
xinleipan / gym-gridworld
Simple grid-world environment compatible with OpenAI-gym
☆50Updated 5 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆125Updated 5 years ago
liampetti / A3C-LSTM
A3C-LSTM algorithm tested on CartPole OpenAI Gym environment
☆48Updated 6 years ago
takuseno / ppo
Proximal Policy Optimization implementation with TensorFlow
☆107Updated 6 years ago
nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
☆106Updated 6 years ago
jsztompka / MultiAgent-PPO
Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis
☆29Updated 6 years ago
yilunc2020 / Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
☆85Updated 6 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
yrlu / reinforcement_learning
Implementation of selected reinforcement learning algorithms in Tensorflow. A3C, DDPG, REINFORCE, DQN, etc.
☆151Updated 2 years ago
flyyufelix / C51-DDQN-Keras
C51-DDQN in Keras
☆126Updated 7 years ago
alexis-jacq / Pytorch-DPPO
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆183Updated 7 years ago
Silvicek / distributional-dqn
Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…
☆132Updated 6 years ago
cxxgtxy / deeprl-baselines
Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…
☆35Updated 6 years ago
uidilr / gail_ppo_tf
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
☆115Updated 6 years ago
Kaixhin / ACER
Actor-critic with experience replay
☆254Updated 2 years ago
takoika / PrioritizedExperienceReplay
Yet another prioritized experience replay buffer implementation.
☆48Updated 2 years ago
Anjum48 / rl-examples
Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow
☆103Updated 4 years ago
fshamshirdar / pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
☆55Updated 2 years ago
go2sea / DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…
☆132Updated 7 years ago
MattChanTK / ai-gym
Repository of deep learning and robotics related practice projects.
☆43Updated 5 years ago
stevenpjg / RDPG
Recurrent Deterministic Policy Gradient actor-critic based Reinforcement Learning algorithm in Action
☆37Updated 4 months ago
stefanbo92 / A3C-Continuous
Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space
☆46Updated 7 years ago