wojzaremba / trpo_rnn
☆20Updated 8 years ago
Alternatives and similar repositories for trpo_rnn:
Users that are interested in trpo_rnn are comparing it to the libraries listed below
- Deterministic Policy Gradient using torch7☆43Updated 8 years ago
- (Deprecated) See https://github.com/mrkulk/Unsupervised-Capsule-Network☆14Updated 9 years ago
- ☆17Updated 7 years ago
- Topics on theoretical, mathematical aspects of DL☆72Updated 8 years ago
- Universal library for deep reinforcement learning.☆38Updated 9 years ago
- Torch implementation reproducing MNIST experiments from DeepMind's DNI paper.☆43Updated 8 years ago
- Learning RNN Hierarchies☆44Updated 8 years ago
- Asynchronous Advantage Actor Critic☆20Updated 8 years ago
- Fun with variational autoencoders.☆11Updated 7 years ago
- Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆42Updated 9 years ago
- From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)☆42Updated 8 years ago
- Code for the blog post on few-shot classification via task representation and communication.☆18Updated 7 years ago
- Natural Gradient implementation in Theano☆19Updated 12 years ago
- Reinforcement learning environments for Torch7☆91Updated 8 years ago
- ☆18Updated 9 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Updated 9 years ago
- torch TH/THC c++11 wrapper☆14Updated 7 years ago
- ☆11Updated 8 years ago
- ☆30Updated 8 years ago
- Torch implementation of the Deep Network for Global Optimization (DNGO)☆51Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- A rudimentary wrapper around the fast Maxwell kernels for GEMM and convolution operations provided by nervanagpu☆34Updated 9 years ago
- Model-Free Episodic Control☆14Updated 8 years ago
- Simple PuddleWorld DQN example using torch7☆29Updated 8 years ago
- Implementations of differentiable stacks, queues, and deques from "Learning to Transduce with Unbounded Memory"☆20Updated 9 years ago
- ☆38Updated 8 years ago
- simple example of gradient-based hyperparameter optimization using tensorflow☆19Updated 9 years ago
- ☆19Updated 9 years ago
- Implementation of the reweighted wake-sleep machine learning algorithm☆41Updated 8 years ago
- Reasonably-okay-performing implementation of a GAN and an adversarial autoencoder on MNIST.☆29Updated 9 years ago