MultiPath / NMT-RDPG
Neural machine translation with Recurrent Deterministic Policy Gradient
☆10Updated 8 years ago
Alternatives and similar repositories for NMT-RDPG:
Users that are interested in NMT-RDPG are comparing it to the libraries listed below
- This is my implementation of the Optimality Tightening☆37Updated 7 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 7 years ago
- ☆25Updated 7 years ago
- Train an RL agent to play multiple Atari games at once☆69Updated 8 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- Distributed Tensorflow Implementation of Asynchronous Methods for Deep Reinforcement Learning☆29Updated 7 years ago
- Collaborative Deep Reinforcement Learning☆32Updated 7 years ago
- ☆56Updated 6 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Asynchronous Advantage Actor Critic☆20Updated 8 years ago
- RNNprop☆36Updated 8 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆56Updated 7 years ago
- Code for the paper "Representation Learning for Grounded Spatial Reasoning"☆52Updated 4 years ago
- Differentiable neural computers☆27Updated 8 years ago
- Tensorflow Implementation of Multi-Function Recurrent Unit☆23Updated 8 years ago
- ☆38Updated 8 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Updated 7 years ago
- Berkeley DeepRL Homework☆11Updated 7 years ago
- ☆28Updated 5 years ago
- Model-Free Episodic Control☆14Updated 8 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Code for Emergent Translation in Multi-Agent Communication☆80Updated 6 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Updated 7 years ago
- Deterministic Policy Gradient using torch7☆43Updated 8 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Updated 7 years ago
- SeqGAN but with more bells and whistles☆24Updated 7 years ago
- ☆43Updated 5 years ago