TimSalimans / weight_normLinks
Lasagne code for weight normalization
☆88Updated 9 years ago
Alternatives and similar repositories for weight_norm
Users that are interested in weight_norm are comparing it to the libraries listed below
Sorting:
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 8 years ago
- A pytorch implementation of "Self-Normalizing Neural Networks" by Klambauer et al. (still beta)☆60Updated 8 years ago
- Reference caffe implementation of LSUV initialization☆114Updated 7 years ago
- Weight initialization schemes for PyTorch nn.Modules☆70Updated 8 years ago
- Tensorflow Implementation on "The Cramer Distance as a Solution to Biased Wasserstein Gradients" (https://arxiv.org/pdf/1705.10743.pdf)☆125Updated 7 years ago
- Reproduction of some of the results from 'Identity Mappings in Deep Residual Networks'☆72Updated 9 years ago
- Working Theano implementation of Pixel RNN on MNIST.☆76Updated 9 years ago
- Code and models from the paper "Layer Normalization"☆244Updated 8 years ago
- Code for Attentive Recurrent Comparators☆57Updated 8 years ago
- Batch-Normalized LSTM (Recurrent Batch Normalization) implementation in Torch.☆90Updated 9 years ago
- ☆69Updated 6 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Torch☆29Updated 9 years ago
- Implementation of http://arxiv.org/abs/1511.05641 that lets one build a larger net starting from a smaller one.☆158Updated 8 years ago
- Implementation of Adversarial Autoencoder with Theano☆40Updated 9 years ago
- DeepArchitect: Automatically Designing and Training Deep Architectures☆146Updated 6 years ago
- ☆121Updated 8 years ago
- Wasserstein DCGAN in Tensorflow/Keras☆93Updated 8 years ago
- Fractional Max Pooling implementation in Theano☆21Updated 10 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆172Updated 8 years ago
- Chainer implementation of Wasserstein GAN☆96Updated 8 years ago
- ☆69Updated 8 years ago
- Generalized Loss-Sensitive Generative Adversarial Networks (GLS-GAN)☆46Updated 8 years ago
- auto-tuning momentum SGD optimizer☆288Updated 6 years ago
- ☆64Updated 8 years ago
- Torch implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).☆130Updated 7 years ago
- Recurrent Models of Visual Attention (RAM) with Chainer☆44Updated 8 years ago
- DrMAD☆107Updated 7 years ago
- visualizing what ConvNets learn with camera☆88Updated 10 years ago
- ImageNet training using torch☆101Updated 8 years ago