yaroslavvb / memory_utilLinks
TensorFlow util for building memory usage timeline from LOG_MEMORY messages
☆65Updated 7 years ago
Alternatives and similar repositories for memory_util
Users that are interested in memory_util are comparing it to the libraries listed below
Sorting:
- Efficient layer normalization GPU kernel for Tensorflow☆111Updated 8 years ago
- Lasagne code for weight normalization☆88Updated 9 years ago
- train on AWS☆75Updated 6 years ago
- Reference caffe implementation of LSUV initialization☆114Updated 7 years ago
- A pytorch implementation of "Self-Normalizing Neural Networks" by Klambauer et al. (still beta)☆59Updated 8 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- Python bindings for pyNVML and psutil library over network☆51Updated last year
- ☆92Updated 8 years ago
- DeepArchitect: Automatically Designing and Training Deep Architectures☆147Updated 5 years ago
- Signal Processing Library for PyTorch☆39Updated 8 years ago
- Code for Attentive Recurrent Comparators☆57Updated 8 years ago
- easy embeddable Torch7 networks☆35Updated 8 years ago
- ☆69Updated 6 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- Source code for ``Neural Networks with Few Multiplications'' published at ICLR 2016☆81Updated 9 years ago
- A new kind of pooling layer for faster and sharper convergence☆76Updated 7 years ago
- Basic library that can run networks created with Torch☆174Updated 5 years ago
- A more memory efficient Torch implementation of "Densely Connected Convolutional Networks".☆29Updated 8 years ago
- DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Torch☆29Updated 8 years ago
- A Python module for compiling PyTorch graphs to C☆91Updated 7 years ago
- Small Python library to automatically set CUDA_VISIBLE_DEVICES to the least loaded device on multi-GPU systems.☆107Updated 2 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆44Updated 8 years ago
- A tutorial on 'Soft weight-sharing for Neural Network compression' published at ICLR2017☆145Updated 8 years ago
- Files to create the figures in the paper "Super-Convergence: Very Fast Training of Residual Networks Using Large Learning Rates"☆191Updated 7 years ago
- Tensorflow Implementation on "The Cramer Distance as a Solution to Biased Wasserstein Gradients" (https://arxiv.org/pdf/1705.10743.pdf)☆125Updated 7 years ago
- Reproduction of some of the results from 'Identity Mappings in Deep Residual Networks'☆72Updated 8 years ago
- started from Alex's code on google code☆43Updated 10 years ago
- ☆29Updated 8 years ago
- Distributed Learning by Pair-Wise Averaging☆52Updated 7 years ago
- Weight initialization schemes for PyTorch nn.Modules☆70Updated 8 years ago