monoelh / deep-reinforcement-learning_DDQN_PPO_HERLinks

MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.

☆20

Alternatives and similar repositories for deep-reinforcement-learning_DDQN_PPO_HER

Users that are interested in deep-reinforcement-learning_DDQN_PPO_HER are comparing it to the libraries listed below

Sorting:

activatedgeek / torchrl
Highly Modular and Scalable Reinforcement Learning
☆115Updated 5 years ago
louaaron / GAN-Q-Learning
Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874
☆47Updated 4 years ago
kazizzad / BDQN-MxNet-Gluon
Efficient Exploration through Bayesian Deep Q-Networks
☆37Updated 7 years ago
jeappen / gym-grid
A simple Gridworld environment for Open AI gym
☆25Updated 7 years ago
JKCooper2 / gym-bandits
Bandits Environments for the OpenAI Gym
☆89Updated 5 years ago
xinleipan / gym-gridworld
Simple grid-world environment compatible with OpenAI-gym
☆50Updated 5 years ago
cjm715 / mgym
A collection of multi-agent reinforcement learning OpenAI gym environments
☆45Updated 5 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
Anjum48 / rl-examples
Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow
☆103Updated 4 years ago
greydanus / baby-a3c
A high-performance Atari A3C agent in 180 lines of PyTorch
☆171Updated 3 years ago
lnpalmer / PPO
PyTorch implementation of Proximal Policy Optimization
☆53Updated 7 years ago
cair / FlashRL
☆27Updated 3 years ago
Silvicek / distributional-dqn
Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…
☆132Updated 6 years ago
cmusjtuliuyuan / RainBow
RainBow, Tensorflow
☆49Updated 7 years ago
brett-daley / dqn-lambda
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
☆23Updated last year
sunblaze-ucb / rl-generalization
Modifiable OpenAI Gym environments for studying generalization in RL
☆87Updated 6 years ago
brendanator / atari-rl
Atari - Deep Reinforcement Learning algorithms in TensorFlow
☆137Updated last year
flyyufelix / C51-DDQN-Keras
C51-DDQN in Keras
☆126Updated 7 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆66Updated 7 years ago
0b01 / CommNet
PyTorch implementation of CommNet
☆36Updated 7 years ago
wwxFromTju / deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Co…
☆127Updated 2 years ago
veronicachelu / meta-learning
Meta Reinforcement Learning Experiments
☆34Updated 7 years ago
david-abel / rl_abstraction
Code for experimenting with state and action abstractions in reinforcement learning.
☆30Updated 4 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆96Updated 2 years ago
go2sea / DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Le…
☆132Updated 7 years ago
liampetti / A3C-LSTM
A3C-LSTM algorithm tested on CartPole OpenAI Gym environment
☆48Updated 7 years ago
uidilr / gail_ppo_tf
Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action
☆115Updated 6 years ago
cxxgtxy / deeprl-baselines
Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…
☆35Updated 6 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆71Updated 8 years ago
LuEE-C / PPO-Keras
My implementation of the Proximal Policy Optisation algorithm using Keras as a backend
☆88Updated 5 years ago