JianGoForIt / YellowFin_PytorchLinks
auto-tuning momentum SGD optimizer
☆288Updated 6 years ago
Alternatives and similar repositories for YellowFin_Pytorch
Users that are interested in YellowFin_Pytorch are comparing it to the libraries listed below
Sorting:
- 2.86% and 15.85% on CIFAR-10 and CIFAR-100☆297Updated 7 years ago
- Tools for PyTorch☆223Updated 3 years ago
- Reference caffe implementation of LSUV initialization☆114Updated 8 years ago
- 🏃 Implementation of Using Fast Weights to Attend to the Recent Past.☆270Updated 6 years ago
- Code and models from the paper "Layer Normalization"☆244Updated 9 years ago
- Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Netw…☆367Updated 7 years ago
- DeepArchitect: Automatically Designing and Training Deep Architectures☆145Updated 6 years ago
- Compare SELUs (scaled exponential linear units) with other activations on MNIST, CIFAR10, etc.☆375Updated 8 years ago
- Batch normalized LSTM for tensorflow☆178Updated 9 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆172Updated 9 years ago
- The first public PyTorch implementation of Attentive Recurrent Comparators☆146Updated 8 years ago
- Supporting public code for SIGBOVIK17 submission☆198Updated 8 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 8 years ago
- OptNet - Reducing memory usage in torch neural nets☆282Updated 8 years ago
- Accelerate Neural Net Training by Progressively Freezing Layers☆212Updated 7 years ago
- Decoupled Neural Interfaces using Synthetic Gradients for PyTorch☆239Updated 6 years ago
- Forward-mode Automatic Differentiation for TensorFlow☆139Updated 7 years ago
- Lasagne code for weight normalization☆88Updated 9 years ago
- ☆138Updated 8 years ago
- DrMAD☆107Updated 8 years ago
- Capsule network with variations. Originally proposed by Tieleman & Hinton : http://www.cs.toronto.edu/~tijmen/tijmen_thesis.pdf☆168Updated 8 years ago
- Adversarially Learned Inference☆311Updated 7 years ago
- Deep Unsupervised Perceptual Grouping☆132Updated 5 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- Fully differentiable deep-neural decision forest in tensorflow☆229Updated 8 years ago
- a lightweight and simple logger for Machine Learning☆127Updated 5 years ago
- This is the project for LS-GAN (Loss-Sensitive GAN)☆215Updated 8 years ago
- 📈 TensorFlow + Matplotlib as TF ops☆299Updated 6 years ago
- FractalNet implementation in Keras: Ultra-Deep Neural Networks without Residuals☆157Updated 8 years ago
- Code for paper "L4: Practical loss-based stepsize adaptation for deep learning"☆124Updated 6 years ago