Code and models from the paper "Layer Normalization"
☆243Nov 8, 2016Updated 9 years ago
Alternatives and similar repositories for layer-norm
Users that are interested in layer-norm are comparing it to the libraries listed below
Sorting:
- Batch-Normalized LSTM (Recurrent Batch Normalization) implementation in Torch.☆90May 22, 2016Updated 9 years ago
- Review Network for Caption Generation☆181Jan 2, 2018Updated 8 years ago
- Mixed Incremental Cross-Entropy REINFORCE ICLR 2016☆333Mar 1, 2017Updated 9 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆110Mar 26, 2017Updated 8 years ago
- 2016 ActivityNet action recognition challenge. CNN + LSTM approach. Multi-threaded loading.☆53Jul 30, 2016Updated 9 years ago
- Structured Prediction Energy Networks in Torch☆132Feb 8, 2017Updated 9 years ago
- Weight initialisation schemes for Torch7 neural network modules☆100Jun 21, 2017Updated 8 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆174Nov 3, 2016Updated 9 years ago
- Learning RNN Hierarchies☆45Jun 22, 2016Updated 9 years ago
- ☆58May 26, 2016Updated 9 years ago
- PyTorch bindings for openai-gemm☆20Feb 6, 2017Updated 9 years ago
- Reinforcement learning environments for Torch7☆91Dec 15, 2016Updated 9 years ago
- Deep Networks with Stochastic Depth☆481Aug 13, 2018Updated 7 years ago
- Recurrent Highway Networks - Implementations for Tensorflow, Torch7, Theano and Brainstorm☆401Oct 9, 2019Updated 6 years ago
- Implements an efficient softmax approximation as described in the paper "Efficient softmax approximation for GPUs" (http://arxiv.org/abs/…☆396Mar 22, 2019Updated 6 years ago
- Torch7 implementation of Grid LSTM as described here: http://arxiv.org/pdf/1507.01526v2.pdf☆186Feb 10, 2016Updated 10 years ago
- Sequence-to-sequence model with LSTM encoder/decoders and attention☆1,282Dec 30, 2020Updated 5 years ago
- Simple PuddleWorld DQN example using torch7☆29Jun 16, 2016Updated 9 years ago
- ByteNet for character-level language modelling☆318Aug 23, 2017Updated 8 years ago
- Learning What and Where to Draw☆336Nov 1, 2016Updated 9 years ago
- OptNet - Reducing memory usage in torch neural nets☆282Apr 19, 2017Updated 8 years ago
- DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Torch☆30Aug 30, 2016Updated 9 years ago
- Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Netw…☆365Nov 22, 2018Updated 7 years ago
- Deterministic Policy Gradient using torch7☆43Jun 2, 2016Updated 9 years ago
- Fast Recurrent Networks Library☆578Sep 20, 2016Updated 9 years ago
- ☆53Mar 23, 2017Updated 8 years ago
- ☆69Dec 19, 2018Updated 7 years ago
- deep extensions to nn☆193May 26, 2017Updated 8 years ago
- An implementation of the deep convolutional generative adversarial network, combined with a varational autoencoder☆109Mar 18, 2017Updated 8 years ago
- ☆121Feb 28, 2017Updated 9 years ago
- Unsupervised learning of visual concepts from video☆56May 5, 2016Updated 9 years ago
- Hyper-parameter Optimization with DrMAD and Hypero☆23Jun 9, 2016Updated 9 years ago
- From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)☆42Nov 3, 2016Updated 9 years ago
- Implementation of a simple example of Q learning in Torch.☆51Mar 5, 2017Updated 8 years ago
- ☆167Aug 8, 2016Updated 9 years ago
- Torch implementations of various types of autoencoders☆476Aug 28, 2017Updated 8 years ago
- THE Deep Learning Benchmarks☆351Nov 2, 2016Updated 9 years ago
- Torch implementation of the Deep Network for Global Optimization (DNGO)☆51Jul 26, 2016Updated 9 years ago
- Tensorflow Implementation of Multi-Function Recurrent Unit☆23Jun 13, 2016Updated 9 years ago