wojzaremba / trpo_rnnLinks
☆20Updated 9 years ago
Alternatives and similar repositories for trpo_rnn
Users that are interested in trpo_rnn are comparing it to the libraries listed below
Sorting:
- Deterministic Policy Gradient using torch7☆43Updated 9 years ago
- (Deprecated) See https://github.com/mrkulk/Unsupervised-Capsule-Network☆14Updated 10 years ago
- Torch implementation reproducing MNIST experiments from DeepMind's DNI paper.☆43Updated 8 years ago
- Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆43Updated 9 years ago
- Learning RNN Hierarchies☆44Updated 9 years ago
- Topics on theoretical, mathematical aspects of DL☆72Updated 8 years ago
- From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)☆42Updated 8 years ago
- Unsupervised learning of visual concepts from video☆56Updated 9 years ago
- Simple PuddleWorld DQN example using torch7☆29Updated 9 years ago
- ☆19Updated 8 years ago
- Universal library for deep reinforcement learning.☆38Updated 9 years ago
- ☆38Updated 8 years ago
- ☆29Updated 8 years ago
- DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Torch☆29Updated 9 years ago
- ☆17Updated 8 years ago
- This is the implementation of paper Model Free Episodic Control☆36Updated 5 years ago
- Cluttered MNIST Dataset☆53Updated 10 years ago
- ☆15Updated 9 years ago
- Code for Attentive Recurrent Comparators☆57Updated 8 years ago
- Train an RL agent to play multiple Atari games at once☆69Updated 9 years ago
- Implementation of the reweighted wake-sleep machine learning algorithm☆42Updated 9 years ago
- RNNprop☆36Updated 8 years ago
- ☆32Updated 8 years ago
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆80Updated 8 years ago
- ☆11Updated 9 years ago
- [adversarial] examples and training cost☆19Updated 9 years ago
- Replication of the paper "Variational Dropout and the Local Reparameterization Trick" using Lasagne.☆33Updated 7 years ago
- simple example of gradient-based hyperparameter optimization using tensorflow☆19Updated 9 years ago
- reinforcement learning. policy gradient. PCL☆37Updated 8 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆43Updated 7 years ago