JianGoForIt / YellowFin_PytorchLinks
auto-tuning momentum SGD optimizer
☆288Updated 6 years ago
Alternatives and similar repositories for YellowFin_Pytorch
Users that are interested in YellowFin_Pytorch are comparing it to the libraries listed below
Sorting:
- Tools for PyTorch☆223Updated 3 years ago
- 2.86% and 15.85% on CIFAR-10 and CIFAR-100☆297Updated 7 years ago
- Code and models from the paper "Layer Normalization"☆244Updated 9 years ago
- Compare SELUs (scaled exponential linear units) with other activations on MNIST, CIFAR10, etc.☆375Updated 8 years ago
- 🏃 Implementation of Using Fast Weights to Attend to the Recent Past.☆270Updated 6 years ago
- Reference caffe implementation of LSUV initialization☆114Updated 8 years ago
- Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Netw…☆367Updated 7 years ago
- DeepArchitect: Automatically Designing and Training Deep Architectures☆145Updated 6 years ago
- OptNet - Reducing memory usage in torch neural nets☆282Updated 8 years ago
- Efficient layer normalization GPU kernel for Tensorflow☆110Updated 8 years ago
- Supporting public code for SIGBOVIK17 submission☆198Updated 8 years ago
- Adversarially Learned Inference☆311Updated 7 years ago
- Lasagne code for weight normalization☆88Updated 9 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆172Updated 9 years ago
- Batch normalized LSTM for tensorflow☆178Updated 9 years ago
- DrMAD☆107Updated 8 years ago
- Deep Unsupervised Perceptual Grouping☆132Updated 5 years ago
- This is the project for LS-GAN (Loss-Sensitive GAN)☆215Updated 8 years ago
- Decoupled Neural Interfaces using Synthetic Gradients for PyTorch☆239Updated 6 years ago
- Accelerate Neural Net Training by Progressively Freezing Layers☆212Updated 7 years ago
- Cleaned original source code from my NIPS publication☆158Updated 8 years ago
- The first public PyTorch implementation of Attentive Recurrent Comparators☆146Updated 8 years ago
- Observations and notes to understand the workings of neural network models and other thought experiments using Tensorflow☆201Updated 6 years ago
- a lightweight and simple logger for Machine Learning☆127Updated 5 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- Standalone TensorBoard for visualizing in deep learning☆371Updated 5 years ago
- supercell☆192Updated 8 years ago
- Code for "On the Effects of Batch and Weight Normalization in Generative Adversarial Networks"☆181Updated 7 years ago
- Generative Adversarial Networks with Keras☆156Updated 5 years ago
- Implementation of http://arxiv.org/abs/1511.05641 that lets one build a larger net starting from a smaller one.☆158Updated 9 years ago