yaroslavvb / memory_utilLinks
TensorFlow util for building memory usage timeline from LOG_MEMORY messages
☆65Updated 7 years ago
Alternatives and similar repositories for memory_util
Users that are interested in memory_util are comparing it to the libraries listed below
Sorting:
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 8 years ago
- Lasagne code for weight normalization☆88Updated 9 years ago
- Reference caffe implementation of LSUV initialization☆114Updated 7 years ago
- A pytorch implementation of "Self-Normalizing Neural Networks" by Klambauer et al. (still beta)☆60Updated 8 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- Code for Attentive Recurrent Comparators☆57Updated 8 years ago
- started from Alex's code on google code☆43Updated 10 years ago
- Working Theano implementation of Pixel RNN on MNIST.☆76Updated 9 years ago
- DeepArchitect: Automatically Designing and Training Deep Architectures☆147Updated 5 years ago
- Code and models from the paper "Layer Normalization"☆245Updated 8 years ago
- Signal Processing Library for PyTorch☆39Updated 8 years ago
- a lightweight and simple logger for Machine Learning☆127Updated 4 years ago
- auto-tuning momentum SGD optimizer☆288Updated 6 years ago
- Reproduction of some of the results from 'Identity Mappings in Deep Residual Networks'☆72Updated 9 years ago
- A new kind of pooling layer for faster and sharper convergence☆76Updated 7 years ago
- ☆69Updated 6 years ago
- Accelerate Neural Net Training by Progressively Freezing Layers☆211Updated 7 years ago
- Chainer implementation of Wasserstein GAN☆96Updated 8 years ago
- Files to create the figures in the paper "Super-Convergence: Very Fast Training of Residual Networks Using Large Learning Rates"☆191Updated 7 years ago
- ☆92Updated 8 years ago
- Torch implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).☆131Updated 7 years ago
- A tutorial on 'Soft weight-sharing for Neural Network compression' published at ICLR2017☆145Updated 8 years ago
- Tensorflow Implementation on "The Cramer Distance as a Solution to Biased Wasserstein Gradients" (https://arxiv.org/pdf/1705.10743.pdf)☆125Updated 7 years ago
- Weight initialization schemes for PyTorch nn.Modules☆70Updated 8 years ago
- Wasserstein DCGAN in Tensorflow/Keras☆93Updated 8 years ago
- Support powerful visual logging in PyTorch.☆103Updated 8 years ago
- Source code for ``Neural Networks with Few Multiplications'' published at ICLR 2016☆81Updated 9 years ago
- Python bindings for pyNVML and psutil library over network☆51Updated last year
- Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"☆65Updated 8 years ago
- ☆64Updated 8 years ago