zamlz / dlcampjeju2018-I2A-cube
Applying Imagination-Augmented Agents for Deep Reinforcement Learning to the Rubik's Cube
☆16Updated 6 years ago
Alternatives and similar repositories for dlcampjeju2018-I2A-cube:
Users that are interested in dlcampjeju2018-I2A-cube are comparing it to the libraries listed below
- Meta Reinforcement Learning Experiments☆34Updated 7 years ago
- Tensorflow implementation of A3C algorithm☆46Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆25Updated 6 years ago
- ☆31Updated 6 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 7 years ago
- Atari-DRQN (keras ver.)☆33Updated 6 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018☆14Updated 3 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- ☆44Updated 6 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Updated 6 years ago
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- PyTorch implementation of Proximal Policy Optimization☆51Updated 7 years ago
- Implement A3C for Mujoco gym envs☆72Updated 7 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Updated 6 years ago
- weekly reinforcement learning paper reviews☆32Updated 7 years ago
- NIPS 2017 Value Prediction Network☆166Updated 7 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Collaborative Deep Reinforcement Learning☆31Updated 7 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Updated 7 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆42Updated 6 years ago
- EEN: Error Encoding Network☆66Updated 7 years ago
- Cool Inverse Reinforcement Learning Papers☆124Updated 8 years ago
- A parallel version of Trust Region Policy Optimization☆65Updated 8 years ago