georgesung / deep_rl_acrobotLinks

TensorFlow A2C to solve Acrobot, with synchronized parallel environments

☆35

Alternatives and similar repositories for deep_rl_acrobot

Users that are interested in deep_rl_acrobot are comparing it to the libraries listed below

Sorting:

flyyufelix / Direct-Future-Prediction-Keras
Direct Future Prediction (DFP ) in Keras
☆109Updated 7 years ago
siemanko / guided-policy-search
Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).
☆43Updated 10 years ago
Breakend / RLSSContinuousControlTutorial
Tutorial on continuous control at Reinforcement Learning Summer School 2017.
☆34Updated 8 years ago
avisingh599 / imitation-dagger
[Reimplementation Ross et al 2011] An implementation of DAGGER using ConvNets for driving from pixels.
☆77Updated 7 years ago
floringogianu / categorical-dqn
A working implementation of the Categorical DQN (Distributional RL).
☆96Updated 7 years ago
AdamStelmaszczyk / learning2run
Our NIPS 2017: Learning to Run source code
☆55Updated 2 years ago
arnomoonens / yarll
Combining deep learning and reinforcement learning.
☆80Updated 3 years ago
rarilurelo / pcl_keras
reinforcement learning. policy gradient. PCL
☆37Updated 8 years ago
isl-org / DirectFuturePrediction
Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017
☆151Updated 11 months ago
kimhc6028 / pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
☆80Updated 6 years ago
zuoxingdong / VIN_TensorFlow
TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular
☆52Updated 8 years ago
rll / deeprlhw2
☆24Updated 9 years ago
kvfrans / parallel-trpo
A parallel version of Trust Region Policy Optimization
☆65Updated 8 years ago
ofirnachum / models
Models built with TensorFlow
☆25Updated 6 years ago
Nat-D / FeatureControlHRL
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆80Updated 7 years ago
dbobrenko / async-deeprl
Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning
☆42Updated 7 years ago
Scitator / Run-Skeleton-Run
Reason8.ai PyTorch solution for NIPS RL 2017 challenge
☆84Updated 5 years ago
eparisotto / ActorMimic
Train an RL agent to play multiple Atari games at once
☆69Updated 9 years ago
MOCR / DDPG
reimplementation of the ddpg algorithm using tensorflow
☆38Updated 8 years ago
go2sea / C51DQN
A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)
☆57Updated 7 years ago
marcino239 / pilco
Using Pilco algorithm to find a controller for few robotic problems
☆43Updated 10 years ago
5vision / DARQN
Deep Attention Recurrent Q-Network
☆115Updated 9 years ago
tanmayshankar / RCNN_MDP
Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.
☆69Updated 7 years ago
deepsense-ai / Distributed-BA3C
☆56Updated 2 years ago
tambetm / gymexperiments
☆28Updated 6 years ago
jjkke88 / RL_toolbox
reinfore learning tool box, contains trpo, a3c algorithm for continous action space
☆42Updated 7 years ago
sudeepraja / Model-Free-Episodic-Control
Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460
☆55Updated 9 years ago
wojzaremba / trpo
☆101Updated 8 years ago
akolishchak / doom-net-pytorch
Reinforcement learning models in ViZDoom environment
☆131Updated 3 years ago
Breakend / DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"
☆152Updated 7 years ago