ryankiros / layer-norm
Code and models from the paper "Layer Normalization"
☆244Updated 8 years ago
Alternatives and similar repositories for layer-norm:
Users that are interested in layer-norm are comparing it to the libraries listed below
- TensorFlow implementation of normalizations such as Layer Normalization, HyperNetworks.☆111Updated 8 years ago
- Benchmarks for several RNN variations with different deep-learning frameworks☆169Updated 5 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 7 years ago
- ☆167Updated 8 years ago
- auto-tuning momentum SGD optimizer☆287Updated 5 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆172Updated 8 years ago
- Implementation of http://arxiv.org/abs/1511.05641 that lets one build a larger net starting from a smaller one.☆159Updated 8 years ago
- Language Modeling☆156Updated 5 years ago
- Study of HeXA@UNIST in Preparation for Submission☆106Updated 8 years ago
- ☆121Updated 7 years ago
- Recreating the Deep Residual Network in Lasagne☆118Updated 8 years ago
- ☆137Updated 7 years ago
- ByteNet for character-level language modelling☆318Updated 7 years ago
- ☆64Updated 7 years ago
- ☆165Updated 8 years ago
- Batch normalized LSTM for tensorflow☆179Updated 8 years ago
- Working Theano implementation of Pixel RNN on MNIST.☆76Updated 8 years ago
- Torch implementation of seq2seq machine translation with GRU RNN and attention☆77Updated 8 years ago
- Mixed Incremental Cross-Entropy REINFORCE ICLR 2016☆332Updated 7 years ago
- ☆143Updated 7 years ago
- Batch-Normalized LSTM (Recurrent Batch Normalization) implementation in Torch.☆91Updated 8 years ago
- Implementations of "LSTM: A Search Space Odyssey" variants and their training results on the PTB dataset.☆95Updated 7 years ago
- Torch7 implementation of Grid LSTM as described here: http://arxiv.org/pdf/1507.01526v2.pdf☆187Updated 9 years ago
- Review Network for Caption Generation☆181Updated 7 years ago
- ☆88Updated 8 years ago
- Lasagne code for weight normalization☆87Updated 8 years ago
- End-To-End Memory Networks in Theano☆130Updated 2 years ago
- Deep Unsupervised Perceptual Grouping☆131Updated 4 years ago
- Examples and scripts using Blocks☆147Updated 8 years ago
- Weight initialisation schemes for Torch7 neural network modules☆100Updated 7 years ago