miyosuda / unreal
Reinforcement learning with unsupervised auxiliary tasks
☆417Updated 5 years ago
Alternatives and similar repositories for unreal:
Users that are interested in unreal are comparing it to the libraries listed below
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆659Updated 4 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Updated 4 years ago
- Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)☆401Updated 7 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆418Updated last year
- Collection of Deep Reinforcement Learning algorithms☆298Updated 5 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆212Updated 6 years ago
- Implementations of deep RL papers and random experimentation☆177Updated 6 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆698Updated 6 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆591Updated 6 years ago
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆274Updated 6 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆193Updated 6 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆265Updated 6 years ago
- Implementation of Meta-RL A3C algorithm☆403Updated 7 years ago
- Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)☆319Updated 4 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆341Updated 6 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆395Updated 6 years ago
- Value Iteration Networks☆290Updated 7 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆266Updated 5 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆189Updated 5 years ago
- Implementation of TRPO and related algorithms☆624Updated 6 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆259Updated 3 months ago
- This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement le…☆214Updated 5 years ago
- A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" agent that works reasonably well☆92Updated 7 years ago
- Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning☆206Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆275Updated 4 years ago
- TensorFlow implementation of the Value Iteration Networks (NIPS '16) paper☆553Updated 5 years ago
- ☆409Updated 6 years ago
- Code for the paper "Evolved Policy Gradients"☆248Updated 6 years ago
- Tensorflow implementation of deep Q networks in paper 'Playing Atari with Deep Reinforcement Learning'☆163Updated 7 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆307Updated 3 years ago