ehknight / natural-gradient-deep-q-learningLinks

☆22

Alternatives and similar repositories for natural-gradient-deep-q-learning

Users that are interested in natural-gradient-deep-q-learning are comparing it to the libraries listed below

Sorting:

kvfrans / parallel-trpo
A parallel version of Trust Region Policy Optimization
☆65Updated 8 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆67Updated 7 years ago
JohnLangford / RL_acid
Some hard problems for reinforcement learning.
☆31Updated 6 years ago
paintception / Deep-Quality-Value-DQV-Learning-
DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm
☆25Updated 2 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
siemens / policy_search_bb-alpha
☆69Updated 7 years ago
floringogianu / categorical-dqn
A working implementation of the Categorical DQN (Distributional RL).
☆96Updated 7 years ago
flowersteam / geppg
☆35Updated 6 years ago
kimhc6028 / pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
☆80Updated 6 years ago
mgbellemare / SkipCTS
Skip Context Tree Switching - Reference Implementation
☆51Updated 7 years ago
openai / baselines-results
☆117Updated 5 years ago
Nat-D / FeatureControlHRL
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆80Updated 7 years ago
sudeepraja / Model-Free-Episodic-Control
Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460
☆55Updated 9 years ago
Breakend / DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"
☆152Updated 7 years ago
ericjang / e2c
TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
☆65Updated 9 years ago
Ardavans / DSR
☆96Updated 8 years ago
clvrai / FeatureControlHRL-Tensorflow
A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆32Updated 7 years ago
AdamStelmaszczyk / learning2run
Our NIPS 2017: Learning to Run source code
☆55Updated 2 years ago
eparisotto / ActorMimic
Train an RL agent to play multiple Atari games at once
☆69Updated 9 years ago
flyyufelix / Direct-Future-Prediction-Keras
Direct Future Prediction (DFP ) in Keras
☆109Updated 7 years ago
Breakend / RLSSContinuousControlTutorial
Tutorial on continuous control at Reinforcement Learning Summer School 2017.
☆34Updated 8 years ago
wojzaremba / trpo
☆101Updated 8 years ago
ilyasu123 / trpo
☆19Updated 9 years ago
arnomoonens / yarll
Combining deep learning and reinforcement learning.
☆80Updated 3 years ago
go2sea / C51DQN
A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)
☆57Updated 7 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆96Updated 6 years ago
steveKapturowski / async-deep-rl
A Tensorflow based implementation of "Asynchronous Methods for Deep Reinforcement Learning": https://arxiv.org/abs/1602.01783
☆7Updated 8 years ago
Scitator / Run-Skeleton-Run
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
☆84Updated 5 years ago
isl-org / DirectFuturePrediction
Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017
☆151Updated 11 months ago
geek-ai / 1m-agents
A platform of grid world that supports up to 1 million reinforcement-learning agents.
☆69Updated 7 years ago