MethodsOfMachineLearning / cabsLinks
Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)
☆44Updated 8 years ago
Alternatives and similar repositories for cabs
Users that are interested in cabs are comparing it to the libraries listed below
Sorting:
- DeepArchitect: Automatically Designing and Training Deep Architectures☆145Updated 6 years ago
- Reproduction of some of the results from 'Identity Mappings in Deep Residual Networks'☆72Updated 9 years ago
- Code for Attentive Recurrent Comparators☆58Updated 8 years ago
- A new kind of pooling layer for faster and sharper convergence☆76Updated 8 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 8 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- Flattened convolutional neural networks (1D convolution modules for Torch nn)☆61Updated 10 years ago
- DrMAD☆107Updated 8 years ago
- Architecture learning for CNN's☆37Updated 8 years ago
- DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Torch☆29Updated 9 years ago
- numpy implementation of net 2 net from the paper Net2Net: Accelerating Learning via Knowledge Transfer http://arxiv.org/abs/1511.05641☆53Updated 9 years ago
- ☆35Updated 8 years ago
- Reference caffe implementation of LSUV initialization☆114Updated 8 years ago
- Lasagne code for weight normalization☆88Updated 9 years ago
- A pytorch implementation of "Self-Normalizing Neural Networks" by Klambauer et al. (still beta)☆60Updated 8 years ago
- Reference implementation for Structured Prediction with Deep Value Networks☆54Updated 8 years ago
- ☆69Updated 7 years ago
- Second-order optimiser for deep networks☆76Updated 7 years ago
- Github repo for my experiments with the orthogonal convolution idea☆22Updated 8 years ago
- Dynamic Capacity Networks using Tensorflow☆52Updated 8 years ago
- ☆69Updated 8 years ago
- Tools to convert Caffe models to neon's serialization format☆39Updated 3 years ago
- Distributed Learning by Pair-Wise Averaging☆52Updated 8 years ago
- Source code for ``Neural Networks with Few Multiplications'' published at ICLR 2016☆80Updated 9 years ago
- TFStage: TensorFlow Project Scaffolding☆62Updated 5 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Updated 9 years ago
- ☆44Updated 8 years ago
- Fractional Max Pooling implementation in Theano☆21Updated 10 years ago
- Wide-residual network implementations. Best result for cifar10(97.12%), cifar100(84.12%), and other kaggle challenges☆37Updated 9 years ago
- Cluttered MNIST Dataset☆53Updated 10 years ago