mcmachado / TrueOnlineSarsaLinks

Implementations of Sarsa(λ) and True Online Sarsa(λ)

☆9

Alternatives and similar repositories for TrueOnlineSarsa

Users that are interested in TrueOnlineSarsa are comparing it to the libraries listed below

Sorting:

ShibiHe / Q-Optimality-Tightening
This is my implementation of the Optimality Tightening
☆37Updated 8 years ago
wulfebw / hierarchical_rl
hierarchical deep reinforcement learning algorithms
☆41Updated 7 years ago
armahmood / totd-rndmdp-experiments
Random MDP experiments on true online TD from a forthcoming work by van Seijen et al. (2015)
☆8Updated 9 years ago
wulfebw / playing_atari
learning to play atari games with reinforcement learning
☆10Updated 9 years ago
sudeepraja / Model-Free-Episodic-Control
Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460
☆55Updated 8 years ago
miyosuda / episodic_control
Model-Free Episodic Control
☆14Updated 8 years ago
wulfebw / async_rl
Python implementation of tabular asynchronous actor critic
☆11Updated 9 years ago
tambetm / gymexperiments
☆28Updated 6 years ago
pkumusic / E-DRL
Exploration Strategies for Deep Reinforcement Learning
☆39Updated 6 years ago
Nat-D / FeatureControlHRL
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆80Updated 7 years ago
go2sea / C51DQN
A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)
☆57Updated 7 years ago
flowersteam / geppg
☆35Updated 6 years ago
tmoer / multimodal_varinf
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
☆35Updated 7 years ago
aravindr93 / robustRL
Robust policy search algorithms which train on model ensembles
☆28Updated 8 years ago
MOCR / DDPG
reimplementation of the ddpg algorithm using tensorflow
☆38Updated 8 years ago
eparisotto / ActorMimic
Train an RL agent to play multiple Atari games at once
☆69Updated 9 years ago
rarilurelo / pcl_keras
reinforcement learning. policy gradient. PCL
☆37Updated 8 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago
5vision / DARQN
Deep Attention Recurrent Q-Network
☆115Updated 9 years ago
arnomoonens / yarll
Combining deep learning and reinforcement learning.
☆80Updated 3 years ago
mcmachado / options
☆43Updated 8 years ago
domluna / deep-rl-gym-tutorials
Some code for tutorials following https://gym.openai.com/docs/rl
☆14Updated 8 years ago
ShibiHe / Model-Free-Episodic-Control
This is the implementation of paper Model Free Episodic Control
☆36Updated 5 years ago
EndingCredits / Neural-Episodic-Control
Implementation of Deepmind's Neural Episodic Control
☆58Updated 7 years ago
Riashat / Bayesian-Exploration-Deep-RL
Bayesian Uncertainty Exploration in Deep Reinforcement Learning
☆18Updated 7 years ago
Breakend / ReproducibilityInContinuousPolicyGradientMethods
These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…
☆17Updated 7 years ago
geek-ai / 1m-agents
A platform of grid world that supports up to 1 million reinforcement-learning agents.
☆69Updated 7 years ago
siemanko / guided-policy-search
Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).
☆43Updated 10 years ago
illidanlab / rpg
Ranking Policy Gradient
☆23Updated 5 years ago
iassael / torch-bootstrapped-dqn
Torch implementation of "Deep Exploration via Bootstrapped DQN"
☆42Updated 9 years ago