MultiPath / NMT-RDPGLinks
Neural machine translation with Recurrent Deterministic Policy Gradient
☆10Updated 8 years ago
Alternatives and similar repositories for NMT-RDPG
Users that are interested in NMT-RDPG are comparing it to the libraries listed below
Sorting:
- ☆56Updated 6 years ago
- Train an RL agent to play multiple Atari games at once☆69Updated 9 years ago
- Differentiable neural computers☆27Updated 8 years ago
- Code for Emergent Translation in Multi-Agent Communication☆80Updated 7 years ago
- ☆92Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆96Updated 7 years ago
- Code for the paper "Representation Learning for Grounded Spatial Reasoning"☆52Updated 5 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- TensorFlow implementation of the paper "Learning to learn by gradient descent by gradient descent ( https://arxiv.org/abs/1606.04474 )"☆84Updated 8 years ago
- A Quick and Dirty Progressive Neural Network written in TensorFlow.☆52Updated 7 years ago
- Benchmark and build RL architectures that can do multitask and transfer learning.☆143Updated 2 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)☆48Updated 7 years ago
- ☆36Updated 10 years ago
- Combining deep learning and reinforcement learning.☆80Updated 3 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- for learning reinforcement learning using PyTorch.☆64Updated 5 years ago
- RNNprop☆36Updated 8 years ago
- DeepArchitect: Automatically Designing and Training Deep Architectures☆147Updated 5 years ago
- The Statistical Recurrent Unit in Pytorch☆33Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 7 years ago
- ☆25Updated 7 years ago
- ☆14Updated 8 years ago
- Optimized Differentiable Neural Computer In Chainer☆23Updated 7 years ago
- Professor Forcing, NIPS'16☆45Updated 8 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Updated 8 years ago
- Asynchronous Advantage Actor Critic☆20Updated 8 years ago
- Deep generative model for sentiment analysis☆34Updated 8 years ago
- ☆49Updated 7 years ago