wchliao / RL_overview_note

Note: "Deep Reinforcement Learning: An Overview"

☆13

Alternatives and similar repositories for RL_overview_note:

Users that are interested in RL_overview_note are comparing it to the libraries listed below

hiwonjoon / tf-a3c-gpu
Tensorflow implementation of A3C algorithm
☆46Updated 7 years ago
wookayin / tensorboard-tools
📉 A collection of TensorBoard-related utilities (In Progress)
☆37Updated 2 years ago
gd-zhang / ACKTR
Actor Critic using Kronecker-Factored Trust Region
☆19Updated 6 years ago
titu1994 / tf-eager-examples
A set of simple examples ported from PyTorch for Tensorflow Eager Execution
☆73Updated 6 years ago
rgilman33 / baselines-A2C
[DEPRECATED] Advantage Actor Critic model in PyTorch inspired by OpenAI baselines TensorFlow implementation
☆53Updated 5 years ago
Kiwoo / distributional_perspective_on_RL
Implementation of A Distributional Perspective on Reinforcement Learning
☆35Updated 7 years ago
anirudh9119 / LM_GANS
Professor Forcing, NIPS'16
☆45Updated 8 years ago
arnomoonens / yarll
Combining deep learning and reinforcement learning.
☆80Updated 3 years ago
anandsaha / nips.cocob.pytorch
PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.
☆37Updated 6 years ago
go2sea / C51DQN
A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)
☆56Updated 7 years ago
preddy5 / seq2set-keras
implementation of http://arxiv.org/pdf/1511.06391v4.pdf in keras
☆13Updated 8 years ago
kimhc6028 / policy-gradient-importance-sampling
Policy gradient reinforcement learning algorithm with importance sampling
☆31Updated 7 years ago
paengs / Net2Net
Tensorflow and Numpy Implementation of Net2Net (http://arxiv.org/abs/1511.05641)
☆47Updated 6 years ago
devsisters / TCML-tensorflow
Tensorflow implementation of Meta-Learning with Temporal Convolutions
☆97Updated 7 years ago
yufengm / Deep-Learning-Notes
Notes for Deep Learning Papers
☆19Updated 6 years ago
jsikyoon / a3c-distributed_tensorflow
Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning
☆29Updated 7 years ago
jayparks / quasi-rnn
A PyTorch Implementation of "Quasi-Recurrent Neural Networks"
☆46Updated 7 years ago
nutszebra / neural_architecture_search_with_reinforcement_learning_appendix_a
Implementation of Appendix A (Neural Architecture Search with Reinforcement Learning: https://arxiv.org/abs/1611.01578) by chainer
☆55Updated 6 years ago
AdeelMufti / DifferentiableNeuralComputer
Optimized Differentiable Neural Computer In Chainer
☆23Updated 6 years ago
ikostrikov / pytorch-rl
☆56Updated 6 years ago
sjchoi86 / deep-uncertainty
Modeling uncertainty information in deep learning
☆22Updated 7 years ago
reinforcement-learning-kr / reinforcement-learning-pytorch
Minimal and Clean Reinforcement Learning Examples in PyTorch
☆42Updated 6 years ago
yunjey / davian-tensorflow
tensorflow tutorial for beginner to intermediate
☆48Updated 7 years ago
carpedm20 / paper-notes
personal notes
☆55Updated 7 years ago
dai-dao / PPO-Pytorch
Implementation of PPO in Pytorch
☆41Updated 7 years ago
aborghi / retro_contest_agent
☆29Updated 6 years ago
google-deepmind / dynamic-kanerva-machines
This is a self-contained memory module for the Dynamic Kanerva Machine, as reported in the NIPS 2018 paper: Learning Attractor Dynamics f…
☆43Updated 6 years ago
ondrejbiza / bandits
Comparison of bandit algorithms from the Reinforcement Learning bible.
☆17Updated 6 years ago
sordonia / zforcing
ZForcing Repo
☆40Updated 7 years ago
kimhc6028 / pathnet-pytorch
PyTorch implementation of PathNet: Evolution Channels Gradient Descent in Super Neural Networks
☆80Updated 7 years ago