wojzaremba / trpo_rnn
☆19Updated 8 years ago
Alternatives and similar repositories for trpo_rnn:
Users that are interested in trpo_rnn are comparing it to the libraries listed below
- Torch implementation reproducing MNIST experiments from DeepMind's DNI paper.☆43Updated 8 years ago
- Learning RNN Hierarchies☆44Updated 8 years ago
- Deterministic Policy Gradient using torch7☆43Updated 8 years ago
- ☆17Updated 7 years ago
- Asynchronous Advantage Actor Critic☆20Updated 8 years ago
- Topics on theoretical, mathematical aspects of DL☆71Updated 8 years ago
- ☆19Updated 8 years ago
- Natural Gradient implementation in Theano☆19Updated 11 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 7 years ago
- A rudimentary wrapper around the fast Maxwell kernels for GEMM and convolution operations provided by nervanagpu☆34Updated 9 years ago
- Torch implementation of the Deep Network for Global Optimization (DNGO)☆51Updated 8 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆52Updated 7 years ago
- ☆29Updated 7 years ago
- ☆30Updated 7 years ago
- (Deprecated) See https://github.com/mrkulk/Unsupervised-Capsule-Network☆14Updated 9 years ago
- simple example of gradient-based hyperparameter optimization using tensorflow☆19Updated 9 years ago
- Universal library for deep reinforcement learning.☆38Updated 8 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 8 years ago
- some RL algorithms☆19Updated 8 years ago
- RNNprop☆36Updated 8 years ago
- Asynchronous One Step Q Learning implemented with MXNET☆20Updated 8 years ago
- Experiment files for the paper "An Analysis of Unsupervised Pre-training in Light of Recent Advances", available here: http://arxiv.org/a…☆18Updated 9 years ago
- ☆32Updated 8 years ago
- ☆15Updated 8 years ago
- Code for the "Binding via Reconstruction Clustering" paper☆21Updated 9 years ago
- [adversarial] examples and training cost☆19Updated 8 years ago
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Updated 8 years ago
- ☆18Updated 9 years ago
- Reasonably-okay-performing implementation of a GAN and an adversarial autoencoder on MNIST.☆29Updated 9 years ago
- ACDC: A Structured Efficient Linear Layer☆43Updated 9 years ago