Kaixhin / end-to-endLinks

Presentation on End-to-End Training of Deep Visuomotor Policies

☆9

Alternatives and similar repositories for end-to-end

Users that are interested in end-to-end are comparing it to the libraries listed below

Sorting:

miyosuda / episodic_control
Model-Free Episodic Control
☆14Updated 8 years ago
iassael / torch-e2c
Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
☆43Updated 9 years ago
5vision / DARQN
Deep Attention Recurrent Q-Network
☆115Updated 9 years ago
junhyukoh / nips2015-action-conditional-video-prediction
Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"
☆114Updated 9 years ago
rarilurelo / pytorch_a3c
☆38Updated 8 years ago
strin / curriculum-deep-RL
Design good curriculums for deep reinforcement learning
☆14Updated 9 years ago
andrewliao11 / pytorch-a3c-mujoco
Implement A3C for Mujoco gym envs
☆72Updated 7 years ago
iassael / torch-policy-gradient
Deterministic Policy Gradient using torch7
☆43Updated 9 years ago
MOCR / DDPG
reimplementation of the ddpg algorithm using tensorflow
☆38Updated 8 years ago
Ardavans / DSR
☆96Updated 8 years ago
ShibiHe / Model-Free-Episodic-Control
This is the implementation of paper Model Free Episodic Control
☆36Updated 5 years ago
rarilurelo / pcl_keras
reinforcement learning. policy gradient. PCL
☆37Updated 8 years ago
kimhc6028 / pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
☆80Updated 6 years ago
openai / rosbridge
[deprecated] Bridge from Gym to ROS robots
☆73Updated 2 years ago
sudeepraja / Model-Free-Episodic-Control
Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460
☆55Updated 9 years ago
iassael / torch-bootstrapped-dqn
Torch implementation of "Deep Exploration via Bootstrapped DQN"
☆42Updated 9 years ago
Nat-D / FeatureControlHRL
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆80Updated 7 years ago
bstadie / third_person_im
third person imitation learning. Archival only.
☆76Updated 5 years ago
isl-org / DirectFuturePrediction
Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017
☆151Updated 11 months ago
marcino239 / pilco
Using Pilco algorithm to find a controller for few robotic problems
☆43Updated 10 years ago
siemanko / guided-policy-search
Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).
☆43Updated 10 years ago
zuoxingdong / VIN_TensorFlow
TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular
☆52Updated 8 years ago
akolishchak / doom-net-pytorch
Reinforcement learning models in ViZDoom environment
☆131Updated 3 years ago
tgangwani / GA3C-DeepNavigation
Tensorflow implementation of DeepMind paper - "Learning to Navigate in Complex Environments"
☆63Updated 8 years ago
jjkke88 / trpo
trust region policy optimization base on gym and tensorflow, can run in distribution mode
☆15Updated 8 years ago
georgesung / deep_rl_acrobot
TensorFlow A2C to solve Acrobot, with synchronized parallel environments
☆35Updated 7 years ago
StanfordVL / ntp
Neural Task Programming
☆81Updated 7 years ago
rlbayes / rllabplusplus
☆159Updated 8 years ago
ilyasu123 / trpo
☆19Updated 9 years ago
Breakend / ReproducibilityInContinuousPolicyGradientMethods
These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…
☆17Updated 7 years ago