JianGoForIt / YellowFin_PytorchLinks
auto-tuning momentum SGD optimizer
β288Updated 6 years ago
Alternatives and similar repositories for YellowFin_Pytorch
Users that are interested in YellowFin_Pytorch are comparing it to the libraries listed below
Sorting:
- Tools for PyTorchβ222Updated 2 years ago
- π Implementation of Using Fast Weights to Attend to the Recent Past.β269Updated 6 years ago
- Efficient layer normalization GPU kernel for Tensorflowβ111Updated 8 years ago
- Reference caffe implementation of LSUV initializationβ114Updated 7 years ago
- 2.86% and 15.85% on CIFAR-10 and CIFAR-100β295Updated 6 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)β172Updated 8 years ago
- Compare SELUs (scaled exponential linear units) with other activations on MNIST, CIFAR10, etc.β375Updated 7 years ago
- Code and models from the paper "Layer Normalization"β245Updated 8 years ago
- DeepArchitect: Automatically Designing and Training Deep Architecturesβ147Updated 5 years ago
- Supporting public code for SIGBOVIK17 submissionβ197Updated 8 years ago
- Accelerate Neural Net Training by Progressively Freezing Layersβ211Updated 6 years ago
- Lasagne code for weight normalizationβ88Updated 9 years ago
- Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Netwβ¦β364Updated 6 years ago
- Deep Unsupervised Perceptual Groupingβ131Updated 4 years ago
- Fully differentiable deep-neural decision forest in tensorflowβ229Updated 7 years ago
- Batch normalized LSTM for tensorflowβ179Updated 8 years ago
- The first public PyTorch implementation of Attentive Recurrent Comparatorsβ146Updated 7 years ago
- Decoupled Neural Interfaces using Synthetic Gradients for PyTorchβ238Updated 6 years ago
- Observations and notes to understand the workings of neural network models and other thought experiments using Tensorflowβ202Updated 5 years ago
- Code for "On the Effects of Batch and Weight Normalization in Generative Adversarial Networks"β181Updated 7 years ago
- DrMADβ107Updated 7 years ago
- OptNet - Reducing memory usage in torch neural netsβ283Updated 8 years ago
- This is the project for LS-GAN (Loss-Sensitive GAN)β213Updated 8 years ago
- Capsule network with variations. Originally proposed by Tieleman & Hinton : http://www.cs.toronto.edu/~tijmen/tijmen_thesis.pdfβ170Updated 7 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training oβ¦β149Updated 8 years ago
- Implementation of http://arxiv.org/abs/1511.05641 that lets one build a larger net starting from a smaller one.β159Updated 8 years ago
- a lightweight and simple logger for Machine Learningβ127Updated 4 years ago
- FractalNet implementation in Keras: Ultra-Deep Neural Networks without Residualsβ156Updated 7 years ago
- Generative Adversarial Networks with Kerasβ156Updated 4 years ago
- β137Updated 7 years ago