zamlz / dlcampjeju2018-I2A-cube

Applying Imagination-Augmented Agents for Deep Reinforcement Learning to the Rubik's Cube

☆16

Alternatives and similar repositories for dlcampjeju2018-I2A-cube:

Users that are interested in dlcampjeju2018-I2A-cube are comparing it to the libraries listed below

veronicachelu / meta-learning
Meta Reinforcement Learning Experiments
☆34Updated 7 years ago
hiwonjoon / tf-a3c-gpu
Tensorflow implementation of A3C algorithm
☆46Updated 7 years ago
gd-zhang / ACKTR
Actor Critic using Kronecker-Factored Trust Region
☆19Updated 6 years ago
reinforcement-learning-kr / rl-montezuma
The state-of-art deep rl algorithms for Montezuma's revenge
☆25Updated 6 years ago
Feryal / automated-curriculum-rl
☆31Updated 6 years ago
Nat-D / FeatureControlHRL
Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆80Updated 7 years ago
floringogianu / categorical-dqn
A working implementation of the Categorical DQN (Distributional RL).
☆96Updated 7 years ago
symoon94 / DRQN-keras
Atari-DRQN (keras ver.)
☆33Updated 6 years ago
wulfebw / hierarchical_rl
hierarchical deep reinforcement learning algorithms
☆41Updated 7 years ago
TianhongDai / self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
☆66Updated 6 years ago
Santara / RAIL
Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018
☆14Updated 3 years ago
go2sea / C51DQN
A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)
☆56Updated 7 years ago
Feryal / craft-env
☆44Updated 6 years ago
lmb-freiburg / td-or-not-td
Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…
☆12Updated 6 years ago
arnomoonens / yarll
Combining deep learning and reinforcement learning.
☆80Updated 3 years ago
dylandjian / retro-contest-sonic
World Models applied to the Open AI Sonic Retro Contest
☆77Updated 6 years ago
shagunsodhani / memory-augmented-self-play
PyTorch implementation of Memory Augmented Self-Play
☆50Updated 4 years ago
lnpalmer / PPO
PyTorch implementation of Proximal Policy Optimization
☆51Updated 7 years ago
andrewliao11 / pytorch-a3c-mujoco
Implement A3C for Mujoco gym envs
☆72Updated 7 years ago
pkumusic / E-DRL
Exploration Strategies for Deep Reinforcement Learning
☆39Updated 6 years ago
rlcode / paper-reviews
weekly reinforcement learning paper reviews
☆32Updated 7 years ago
junhyukoh / value-prediction-network
NIPS 2017 Value Prediction Network
☆166Updated 7 years ago
jeanharb / a2oc_delib
A3C style Option-Critic with deliberation cost
☆39Updated 7 years ago
illidanlab / cdrl
Collaborative Deep Reinforcement Learning
☆31Updated 7 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆50Updated 6 years ago
clvrai / FeatureControlHRL-Tensorflow
A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆32Updated 7 years ago
reinforcement-learning-kr / reinforcement-learning-pytorch
Minimal and Clean Reinforcement Learning Examples in PyTorch
☆42Updated 6 years ago
mbhenaff / EEN
EEN: Error Encoding Network
☆66Updated 7 years ago
sjchoi86 / irl_rocks
Cool Inverse Reinforcement Learning Papers
☆124Updated 8 years ago
kvfrans / parallel-trpo
A parallel version of Trust Region Policy Optimization
☆65Updated 8 years ago