qbx2 / PAAC.pytorch
Pytorch implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning https://arxiv.org/abs/1705.04862
☆20Updated 7 years ago
Alternatives and similar repositories for PAAC.pytorch
Users that are interested in PAAC.pytorch are comparing it to the libraries listed below
Sorting:
- Per-session checkpoint for boosting up your research☆12Updated 6 years ago
- Catch game example is translated by TensorFlow☆16Updated 8 years ago
- Density Network Implementations using TensorFlow☆27Updated 6 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 6 years ago
- ☆49Updated 6 years ago
- 📉 A collection of TensorBoard-related utilities (In Progress)☆37Updated 2 years ago
- Tensorflow implementation of Meta-Learning with Temporal Convolutions☆97Updated 7 years ago
- The DQN agent which plays breakout-v0 in gym.openai.com☆11Updated 7 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆42Updated 6 years ago
- Summary (in Korean) and python implementation of 'Reinforcement Learning: An Introduction' written by Sutton & Barto☆57Updated 6 years ago
- Tensorflow implementation of A3C algorithm☆46Updated 7 years ago
- Repository for studying distributional rl☆30Updated 3 months ago
- weekly reinforcement learning paper reviews☆32Updated 7 years ago
- PyTorch KR Tutorial Competition 2018☆60Updated 6 years ago
- ☆57Updated 6 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- Modeling uncertainty information in deep learning☆22Updated 7 years ago
- ☆16Updated 7 years ago
- Scalable distributed reinforcement learning agents on kubernetes☆57Updated last year
- Simple implementation of Least Squares Generative Adversarial Networks☆43Updated 7 years ago
- This is a self-contained memory module for the Dynamic Kanerva Machine, as reported in the NIPS 2018 paper: Learning Attractor Dynamics f…☆43Updated 6 years ago
- 강화학습에 대한 기본적인 알고리즘 구현☆116Updated 6 years ago
- ☆32Updated 8 years ago
- ☆10Updated this week
- Sentiment regression using doc2vec on watcha movie review data☆11Updated 9 years ago
- This repository implements the paper, Model-Agnostic Meta-Leanring for Fast Adaptation of Deep Networks.☆16Updated 7 years ago
- ☆18Updated 7 years ago
- This repository contains tutorial material on Doing DeepRL with PPO in GDG DevFest 2017 Seoul.☆20Updated 7 years ago
- Atari-DRQN (keras ver.)☆33Updated 6 years ago