mgbellemare / SkipCTSLinks

Skip Context Tree Switching - Reference Implementation

☆51

Alternatives and similar repositories for SkipCTS

Users that are interested in SkipCTS are comparing it to the libraries listed below

Sorting:

sudeepraja / Model-Free-Episodic-Control
Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460
☆55Updated 8 years ago
kvfrans / parallel-trpo
A parallel version of Trust Region Policy Optimization
☆65Updated 8 years ago
Ardavans / DSR
☆96Updated 8 years ago
ShibiHe / Q-Optimality-Tightening
This is my implementation of the Optimality Tightening
☆37Updated 8 years ago
chrodan / tdlearn
some common TD Learning algorithms
☆66Updated 5 years ago
wojzaremba / trpo
☆101Updated 8 years ago
floringogianu / categorical-dqn
A working implementation of the Categorical DQN (Distributional RL).
☆96Updated 7 years ago
eparisotto / ActorMimic
Train an RL agent to play multiple Atari games at once
☆69Updated 9 years ago
wgrathwohl / BackpropThroughTheVoidRL
RL Experiments from our paper "Backpropagation Through the Void": https://arxiv.org/abs/1711.00123. Lovingly forked from OpenAI's RL Base…
☆38Updated 7 years ago
pierrelux / rlss2017
☆17Updated 8 years ago
wojzaremba / trpo_rnn
☆20Updated 9 years ago
ilyasu123 / trpo
☆19Updated 9 years ago
iassael / torch-e2c
Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
☆42Updated 9 years ago
Kaixhin / rlenvs
Reinforcement learning environments for Torch7
☆91Updated 8 years ago
cilvrRG / RL
Reading Group on Reinforcement Learning topics
☆56Updated 8 years ago
ShibiHe / Model-Free-Episodic-Control
This is the implementation of paper Model Free Episodic Control
☆36Updated 5 years ago
Bonnevie / rebar
A Python implementation of the gradient REBAR estimator.
☆46Updated 7 years ago
rarilurelo / pytorch_a3c
☆38Updated 8 years ago
ericjang / e2c
TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
☆65Updated 9 years ago
seba-1511 / drl.pth
Implementation of Policy Gradient algorithms in PyTorch. (Sequential, Distributed sync + async)
☆9Updated 7 years ago
tonywu95 / eval_gen
Evaluation code with models for the paper "On the Quantitative Analysis of Decoder-Based Generative Models"
☆130Updated 7 years ago
Nat-D / FeatureControlHRL
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆80Updated 7 years ago
junhyukoh / icml2016-minecraft
Implementation of "Control of Memory, Active Perception, and Action in Minecraft"
☆86Updated 8 years ago
rarilurelo / pcl_keras
reinforcement learning. policy gradient. PCL
☆37Updated 8 years ago
iassael / torch-bootstrapped-dqn
Torch implementation of "Deep Exploration via Bootstrapped DQN"
☆42Updated 9 years ago
runopti / Learning-To-Learn
TensorFlow implementation of the paper "Learning to learn by gradient descent by gradient descent ( https://arxiv.org/abs/1606.04474 )"
☆84Updated 8 years ago
rll / deeprlhw2
☆24Updated 9 years ago
5vision / DARQN
Deep Attention Recurrent Q-Network
☆115Updated 9 years ago
ThomasMiconi / LearningToLearnBOHP
Backpropagation training of neural networks with Hebbian plastic connections
☆31Updated 4 years ago
strin / curriculum-deep-RL
Design good curriculums for deep reinforcement learning
☆14Updated 9 years ago