armahmood / totd-rndmdp-experimentsLinks

Random MDP experiments on true online TD from a forthcoming work by van Seijen et al. (2015)

☆8

Alternatives and similar repositories for totd-rndmdp-experiments

Users that are interested in totd-rndmdp-experiments are comparing it to the libraries listed below

Sorting:

ShibiHe / Q-Optimality-Tightening
This is my implementation of the Optimality Tightening
☆37Updated 8 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago
mcmachado / options
☆43Updated 8 years ago
flowersteam / rl-difference-testing
Simple tools for statistical analyses in RL experiments
☆66Updated 6 years ago
iosband / TabulaRL
☆65Updated last year
flowersteam / geppg
☆35Updated 6 years ago
rddy / isql
Inferring beliefs about dynamics from behavior
☆29Updated 7 years ago
bstadie / krazyworld
krazy grid world
☆25Updated 5 years ago
jeanharb / a2oc_delib
A3C style Option-Critic with deliberation cost
☆39Updated 7 years ago
tmoer / multimodal_varinf
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
☆35Updated 7 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
david-abel / rl_abstraction
Code for experimenting with state and action abstractions in reinforcement learning.
☆31Updated 4 years ago
flowersteam / Unsupervised_Goal_Space_Learning
Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"
☆21Updated 7 years ago
mcgillmrl / robot_learning
ROS package for robot learning
☆17Updated 5 years ago
KyriacosShiarli / taco
☆25Updated 6 years ago
sunblaze-ucb / rl-generalization
Modifiable OpenAI Gym environments for studying generalization in RL
☆87Updated 6 years ago
aravindr93 / robustRL
Robust policy search algorithms which train on model ensembles
☆28Updated 8 years ago
ericjang / e2c
TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
☆65Updated 9 years ago
AdamStelmaszczyk / learning2run
Our NIPS 2017: Learning to Run source code
☆55Updated 2 years ago
lmb-freiburg / td-or-not-td
Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…
☆12Updated 6 years ago
ScottJordan / EvaluationOfRLAlgs
This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms
☆27Updated 3 years ago
Feryal / craft-env
☆44Updated 6 years ago
kkhetarpal / ioc
Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020
☆25Updated 4 years ago
DanielTakeshi / imitation
☆13Updated 7 years ago
Breakend / OptionGAN
Code accompanying the OptionGAN paper.
☆44Updated 6 years ago
dtak / hip-mdp-public
Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning
☆32Updated 7 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 6 years ago
eringrant / spirl-readings
A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.
☆13Updated 4 years ago
evgenii-nikishin / omd
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
☆43Updated 3 years ago
Breakend / ReproducibilityInContinuousPolicyGradientMethods
These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…
☆17Updated 7 years ago